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WEIGHT CONTROL PATHWAY GENES AND USES THEREFOR 

Background of the Invention 

Obesity represents the most prevalent of body weight disorders, and it is the most 
important nutritional disorder in the western world, with estimates of its prevalence 
ranging from 30% to 50% within the middle-aged population. Other body weight 
disorders, such as anorexia nervosa and bulimia nervosa which together affect 
approximately 0.2% of the female population of the western world, also pose serious 
health threats. Further, such disorders as anorexia and cachexia (wasting) are also 
prominent features of other diseases such as cancer, cystic fibrosis, and AIDS. 

Obesity, defined as an excess of body fat relative to lean body mass, also 
contributes to other diseases. For example, this disorder is responsible for increased 
incidences of diseases such as coronary artery disease, hypertension, stroke diabetes, 
hyperlipidemia and some cancers. (See, e.g., Nishina, P.M. et al. (1994) Metab. -/J.-554- 
558; Grundy, S.M. and Barnett, J.P. (1990) Dis. Mon. 36:641-731) Obesity is not 
merely a behavioral problem, i.e., the result of voluntary hyperphagia. Rather, the 
differential body composition observed between obese and norma! subjects results from 
differences in both metabolism and neurologic/metabolic interactions. These differences 
seem to be, to some extent, due to differences in gene expression, and /or level of gene 
20 products or activity (Friedman, J.M. et al.( 1 991 ) Mammalian Gene I: 1 30- 1 44). 

The epidemiology of obesity strongly shows that the disorder exhibits inherited 
characteristics (Stunkard (1990) N. Eng. J. Med 522:1483). Moll et al. have reported 
that , in many populations, obesity seems to be controlled by a few genetic loci (Moll et 
al. (1991) Am J. Hum. Gen. V9:1243). In addition, human twin studies strongly suggest 
a substantial genetic basis in the control of body weight, with estimates of heritability of 
80-90% (Simopoulos. A. P. and Childs B., eds., 1989, in "Genetic Variation and 
Nutrition in Obesity", World Review of Nutrition and Diabetes 63, S. Karger, Basel, 
Switzerland; Borjeson, M., 1 976, Acta. Paediatr. Scand. 65:279-287). 

Studies of non-obese persons who deliberately attempted to gain weight by 
systematically over-eating were found to be more resistant to such weight gain and able 
to maintain an elevated weight only by very high caloric intake. In contrast, 
spontaneously obese individuals are able to maintain their status with normal or only 
moderately elevated caloric intake. In addition, it is a commonplace experience in 
animal husbandry that different strains of swine, cattle, etc.. have different 
predispositions to obesity. Studies of the genetics of human obesity and of models of 
animal obesity demonstrate that obesity results from complex defective regulation of 
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both food intake, food induced energy expenditure and of the balance between lipid and 
lean body anabolism. 

There are a number of genetic diseases in man and other species which feature 
obesity among their more prominent symptoms, along with, frequently, dysmorphic 
5 features and mental retardation. For example, Prader-Willi syndrome (PWS; reviewed 
in Knoll, J.H. et al. (1993) Am. J. Med Genet. 46:2-6) affects approximately 1 in 20,000 
live births, and involves poor neonatal muscle tone, facial and genital deformities, and 
generally obesity. 

In addition to PWS, many other pleiotropic syndromes which include obesity as 

10 a symptom have been characterized (e.g. Ahlstroem, Carpenter, Bardet-Biedl, Cohen, 
and Morgagni-Stewart-Monel Syndromes). These syndromes are more genetically 
straightforward and appear to involve autosomal recessive alleles. 

A number of models exist for the study of obesity (see, e.g.. Bray, G.A. (1992) 
Prog, Brain Res. £3:333-341, and Bray, G. A. (1989) Amer. J. Clin. Nutr. 5:891-902). 

1 5 For example, animals having mutations which lead to syndromes that include obesity 
symptoms have also been identified. Attempts have been made to utilize such animals 
as models for the study of obesity, and the best studied animal models, to date, for 
genetic obesity are mice. For reviews, see e.g., Friedman, J.M. et al. ( 1 991 ) Mamm. 
Gen. 1:130-144; Friedman, J.M. and Liebel, R.L. (1992) Cell 69:217-220. 

20 Studies utilizing mice have confirmed that obesity is a very complex trait with a 

high degree of heritability. Mutations at a number of loci have been identified which 
lead to obese phenotypes. These include the autosomal recessive mutations obese (ob\ 
diabetes (db). fat (fat) and tubby (tub). In addition, the autosomal dominant mutations 
Yellow at the agouti locus and Adipose (Ad) have been shown to contribute to an obese 

25 phenotype. 

The ob and db mutations are on chromosomes 6 and 4, respectively, but lead to 
clinically similar pictures of obesity, evident starting at about one month of age, which 
include hyperphagia. severe abnormalities in glucose and insulin metabolism, very poor 
thermoregulation and non-shivering thermogenesis, and extreme torpor and 

30 underdevelopment of the lean body mass. 

The ob gene and its human homologuc have recently been cloned (Zhang, Y. et 
al., (1994) Nature 372:425-432). The gene appears to produce a 4.5 kb adipose tissue 
messenger RNA which contains a 167 amino acid open reading frame. The predicted 
amino acid sequence of the ob gene product indicates that it is a secreted protein and 

35 may, therefore, play a role as part of a signaling pathway from adipose tissue which may 
serve to regulate some aspect of body fat deposition. 
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The db locus encodes a high affinity receptor for the ob gene product (Chen H 
et al. Cell 84:491 -495). The db gene product is a single membrane-spanning receptor ' 
most closely related to the gp!30 cytokine receptor signal transducing component 
(Tartagha, L.A. etal. (1995) Cell 83:1263-1271). 
5 Homozygous mutations at either the/*, or tub loci cause obesity which develops 

more slowly than that observed in ob and db mice (Coleman, D.L., and Eicher E M 
(1 990) J. Heredity 81 :424-427). with tub obesity developing slower than that observed 
m fai ammals. This feature of the tub obese phenotype makes the development of tub 
obese phenotype closest in resemblance to the manner in which obesity develops in 
10 humans. Even so, however, the obese phenotype within such animals can be 

charactenzed as massive in that animals eventually attain body weights which are nearly 
two tunes the average weight seen in normal mice, tub/tub mice develop insulin 
res.stance with their weight gain but do not progress to overt diabetes 

In addition to obesity, retinal defects, hearing loss and infertility have all been 
15 observed in tub mice (Heckenlively, 1988, in Retinitis Pigmentosa, Heckenlively ed 

L, Ppi ncott, Philadelphia, pp. 221-235; Coleman, D.L.& Eicher, E.M., 1990 J Hered 
81:424-4a27; Ohlemiller, K.K. etal. (1995) AW e/W r, 6:845-849). Several human 
syndromes exist in which such defects are found to co-exist with an obesity phenotype 
mcluding Bardet-Bied. syndrome, Ahlstroem syndrome, polycystic ovarian disease and 
20 Usher's syndrome. 

The/a, mutation has been mapped to mouse chromosome 8, while the tub 
mutafon has been mapped to mouse chromosome 7. According to Naggert et al the fat 
mutanon has recently been identified (Naggert, J.K., et al. (1995) Nature Genetics 
10:135-141). Specifically, the fat mutation appears to be a mutation within the Cpe 
locus, wh,ch encodes the carboxypeptidase (Cpe) E protein. Cpe is an exopeptidase 
involved ,n the processing of prohormones, including proinsulin. 

The dominant Yellow mutation at the agouti locus, causes a pleiotropic 
syndrome wh.ch causes moderate adult onset obesity, a yellow coat color, and a high 
mcdence of tumor formation (Herberg, L. and Coleman, D.L. (1 977) Metabolism 
26:59), and an abnormal anatomic distribution of body fat (Coleman. D.L ( 1 978) 
Diabetologia 14:141-148). This mutation may represent the only known example of a 
plc.otrop,c mutation that causes an increase, rather than a decrease, in body size The 
mutanon causes the widespread expression of a protein which is normally seen only in 
neonatal skin (Michaud, EJ. et al. (1994) Genes Devel. 8:1463-147-7) 

,h ,/ TVl™ 1 m ° dC,S M/a//b < ftt *> rats < w ™<* bear many similarities to 
the ob/ob and db/db mice, discussed above. One difference is that, ^ fa/fa rats are 
very sensitive to cold, their capacity for non-shivering thermogenesis is normal. Torpor 
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seems to play a larger part in the maintenance of obesity in fa/fa rats than in the mice 
mutants. In addition, inbred mouse strains such as NZO mice and Japanese KK mice are 
moderately obese. Certain hybrid mice, such as the Wellesley mouse, become 
spontaneously fat. Further, several desert rodents, such as the spiny mouse, do not 
5 become obese in their natural habitats, but do become so when fed on standard 
laboratory feed. 

Animals which have been used as models for obesity have also been developed 
via physical or pharmacological methods. For example, bilateral lesions in the 
vetromedial hypothalamus (VMH) and ventrolateral hypothalamus (VLH) in the rat are 

10 associated, respectively, with hyperphagia and gross obesity and with aphagia, cachexia 
and anorexia. Further, it has been demonstrated that feeding monosodiumglutamate 
(MSG) or gold thioglucose to newborn mice also results in an obesity syndrome. 

In summary, therefore, obesity, which poses a major, worldwide health problem, 
represents a highly heritable trait. Given the severity, prevalence and potential 

1 5 heterogeneity of such disorders, there exists a great need for the identification genes 
involved in the control of body weight. 

Summary of the Invention 

The present invention is based on the discovery of novel molecules, referred to 

20 herein as "Tub Interactor" ('TT) nucleic acid and polypeptide molecules. Exemplary 
novel TI molecules are contained in and encoded by: 1) E. coli plasmid ptyhq049, which 
was deposited with the American Type Culture Collection (ATCC) on August 6, 1996 
and has been assigned ATCC designation number 98125; 2 E. coli plasmid ptyhq054, 
which was deposited with the American Type Culture Collection (ATCC) on August 6, 

25 1996 and has been assigned ATCC designation number 98126; 3) E. coli plasmid 

ptyhq058, which was deposited with the American Type Culture Collection (ATCC) on 
August 6, 1996 and has been assigned ATCC designation number 98127; and 4) E coli 
plasmid ptyhq036, which was deposited with the American Type Culture Collection 
(ATCC) on August 6, 1 996 and has been assigned ATCC designation number 98128. 

30 Six novel TI genes were cloned and identified based on their ability to interact 

with the C-terminus (i.e. the last 44 amino acids) of htuh in a two hybrid assay as further 
described in the following Examples. hTI-1 (Figure 1 (SEQ ID NO.l)) is a 1386 base 
pair nucleic acid encoding a serine protease. hTI-2 (Figure 2 (SEQ ID NO:2)) is a 2103 
base pair nucleic acid containing ANK (i.e, ankyrin) repeats. hTI-3 (Figure 3 (SEQ ID 

35 NO:3)) is a 1 048 base pair nucleic acid containing TPR repeats (i.e. tetralicopeptide 
repeats) and also DNAJ repeats. mTI-3 (Figure 4 (SEQ ID NO:4)) is a 1700 base pair 
nucleic acid that is the murine homologue of hTl-3. hTl-4 (Figure 5 (SEQ ID NO:5)) is 



WO 98/12302 



PCT/US97/15627 



10 



a 1 42 1 base pair nucleic acid that contains RING finger repeats and also Zinc finger 
repeats. mT,-4 (Figure 6 (SEQ ID NO*)) is a 2 12 1 base pair nucleic acid that is the 
munne homo.ogue of hTI-4. A final T, gene (hT,- 5) w*s .dentified as encoding human 
scnne palm.toyltransferase (GenBank Accession No. UI5555). 

In one aspect, the invention features isolated vertebrate 77 nucleic acid 
nucules. The disclosed molecules can be non-coding, (e.g. probe, antisense or 
nbozyme molecules) or can encode a functional 77 polypeptide (e.g. a polypeptide 
wh.ch specially modulates, e.g., by acting as either an agonist orlt^ Ls, 
one b.oacnvny of the human TI polypeptide). ,n one embodiment, the nucleic acid 

ZTTsm 8 2 ^ ™f 8ene C ° mained ^ ^ ° f ATCC **"»*« numbers 

of ATCC d " w t0 C ° mP,ement ° f thC 11 *" COntai ^ in any 

of ATCC des.gnat.on numbers 98125, 98126, 981257. or98128. In another 

embodiment, the nucleic acids of the present invention can hybridize to a vertebrate 77 
geneortothecomplementofave rt ebrater7 g ene. In a further embodiment the 
15 claimed nucleic acid can hybrid.ze with the nucleic acid sequence, designated in SEQ 
ID NOs.i 6 or to the complement to the nucleic acid sequence designated in SEQ ID 
NOs:,-6. In a preferred embodiment, the hybridization is conducted under mildly 
stringent or stnngent conditions. 

In further embodiments, the nucleic acid molecule is a 77 nucleic acid that is at 
least 70%, preferably 80%, more preferably RS°/ * a 

or Qso/ u n , Prelerably 85 /„, and even more preferably at least 90% 

95 /„ homologous .n sequence to any of the nucleic acids shown as SEQ ID NOsl-6 
or to the complement of the nucleic acid shown as SEQ ID NOs:l-6 In a further ' 
embodiment, the nucleic acid molecule is a 77 nucleic acid that is at least 70% 
preferably 80%, more preferably 85% and even more preferably at least 90% or 95% 

% TmTZT IT con,ained in ■* of ATCC designation 

of ATCC H ' " ^ " l ° C ° mP,ement ° f thC »» any 

of ATCC des.gnat.on numbers 98125, 98126, 981257 or 98128 

The invention also provides probes and primers' comprising substantially purified 
ohgonucleot.de, which correspond to a region of nucleotide sequence which hy bridizls 
to at leas, 6 consecutive nucleotides of any of the sequences set forth as SEQ ID No" 
6 or complements of any of the sequences set forth as SEQ ID NO, . -6 or naturally ' 
occu mutants thereof. In preferred embodiment, the probe/primer further 
a label group attached thereto, which is capable of being detected 

For expression, the subject nucleic acids can .nclude a transcriptional regulatory 
equence e .g. at least one of a transcriptional promoter (e.g.. f or cons Live ex pre In 
or mducblc express.on) or transcriptiona. enhancer sequence, which regulatory 
sequence , operably linked to the gene sequent Such regu.atory sequences! 
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conjunction with a 77 nucleic acid molecule can provide a useful vector for gene 
expression. This invention also describes host cells transfected with said expression 
vector whether prokaryotic or eukaryotic and in vitro (e.g. cell culture) and in vivo (e.g. 
transgenic) methods for producing 77 proteins by employing said expression vectors. 
5 In another aspect, the invention features isolated 77 polypeptides, preferably 

substantially pure preparations e.g. of plasma purified or recombinantly produced 
polypeptides. In preferred embodiments, the polypeptide is able to bind to the C- 
terminus (e.g. the last 44 amino acids) of the human tub protein. In particularly 
preferred embodiments, the subject polypeptides, whether agonists or antagonists, can 

10 suppress the development and/or progression of a weight disorder (obesity, cachexia or 
anorexia nervosa) or a related disorder (e.g. diabetes). 

In a preferred embodiment, the 77 polypeptide is encoded by a nucleic acid 
which hybridizes with any of the nucleic acid sequences represented in SEQ ID NOs: 1-6 
or with the gene or gene fragment contained in any of ATCC Designation Nos. 

15 designation numbers 98125, 98126, 981257, or 98128. The subject 77 proteins also 
include modified protein, which are resistant to post-translational modification, as for 
example, due to mutations which alter modification sites (such as tyrosine, threonine, 
serine or aspargine residues), or which prevent glycosylation of the protein, or which 
prevent interaction of the protein with intracellular proteins involved in signal 

20 transduction. 

The 77 polypeptides can comprise a full length protein or it can comprise a 
fragment corresponding to one or more particular motifs/domains, or to arbitrary sizes, 
e.g., at least 5, 10, 25, 50, 100, 150 or 200 amino acids in length. In preferred 
embodiments, the polypeptide includes a sufficient portion of the domain that interacts 

25 with the C-terminus (i.e. the last 44 amino acids) of normal human tub. 

Another aspect of the invention features chimeric molecules (e.g. fusion 
proteins) comprised of a TI protein. For instance, the TI protein can be provided as a 
recombinant fusion protein which includes a second polypeptide portion, e.g., a second 
polypeptide having an amino acid sequence unrelated (heterologous) to the 77 

30 polypeptide, (e.g. the second polypeptide portion is glutathione-S-transferase, an 
enzymatic activity such as alkaline phosphatase or an epitope tag). 

Yet another aspect of the present invention concerns an immunogen comprising 
a 77 polypeptide in an immunogenic preparation, the immunogen being capable of 
eliciting an immune response specific for a 77 polypeptide: e.g. a humoral response, an 

35 antibody response and/or cellular response. In a preferred embodiment, the immunogen 
comprises an antigenic determinant, e.g. a unique determinant of a protein encoded by 
any of the nucleic acids SEQ ID NOs: 1-6. 
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A still further aspect of the present invention features antibodies and antibody 
preparations specifically reactive with an epitope of a 77 protein. 

The invention also features transgenic non-human animals which include (and 
preferably express) a heterologous form of a 77 gene described herein, or which 
5 m.sexpress an endogenous 77 gene (e.g., an animal in which expression of one or more 
of the subject TYproteins is disrupted). Such a transgenic animal can serve as an animal 
model for studying cellular and tissue disorders comprising mutated or mis-expressed 77 
alleles or for use in drug screening. Alternatively, such a transgenic animal can be useful 
tor expressing recombinant 77 polypeptides. 
1 0 The invention also pertains to methods for identifying a compound or agent 

wh,ch mteracts with (e.g., binds to) a 77 polypeptide. These methods can include the 
steps of contacting the 77 polypeptide with the compound or agent under conditions 
wh,ch allow binding of the compound to the 77 polypeptide to form a complex and 
detectmg the formation of a complex of the 77 polypeptide and the compound in which 
the abUity of the compound to bind to the TI polypeptide is indicated by the presence of 
the compound in the complex. 

In yet another aspect, the invention provides assays, e.g., for screening test 
compounds to identify modulators (e.g., inhibitors, or alternatively, potentiators) of an 
mteraction between a TI protein and, for example, a tub polypeptide. An exemplary 
method mcludes the steps of (i) combining a TI protein or bioactive fragment thereof, a 
77 protein target molecule (such as Tub), and a test compound, e.g., under conditions' 
wherein, but for the test compound, the 77 protein and target molecule are able to 
interact; and (ii) detecting the formation of a complex which includes the TI protein and 
the target polypeptide either by directly quantitating the complex, by measuring 
inductive effects of the 77 protein, or, in the instance of a substrate, measuring the 
conversion to product. A statistically significant change, such as a decrease, in the 
mteraction of the TI protein and target molecule in the presence of a test compound 
(relative to what is detected in the absence of the test compound) is indicative of a 
modulation (e.g., inhibition or potentiation of the interaction between the TI protein and 
30 the target molecule). 

Yet another aspect of the present invention concerns a method for modulating 
apoptosis in a cell by modulating 77bioactivity, (e.g., by potentiating or disrupting 
certain protein-protein interactions). In general, whether carried out /* v,V„. in vitro or 
m suu, the method comprises treating the cell with an effective amount of a 77 
35 therapeutic so as to alter, relative to the cell in the absence of treatment, lipid uptake by 
the cell. Accordingly, the method can be carried out with TI modulating agents such as 
pept.de and peptidomimetics or other molecules identified in the above-referenced drug 
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screens which agonize or antagonize the effects of signaling in a biochemical pathway 
involving a 77 protein. Other modulating agents for use as therapeutics include 
antisense constructs for inhibiting expression of 77 proteins, and dominant negative 
mutants of 77 proteins which competitively inhibit ligand interactions upstream and 
5 signal transduction downstream of the wild-type 77 protein. 

A further aspect of the present invention provides a method of determining if a 
subject is at risk for a disorder characterized by inappropriate 77 protein expression, 
such as, for example, a weight disorder (e.g. obesity, cachexia or anorexia nervosa) or a 
related disorder, such as diabetes. The method includes detecting, in a tissue of the 

10 subject, the presence or absence of a genetic lesion characterized by at least one of (i) a 
mutation of a gene encoding a 77 protein, e.g. represented in any of SEQ ID NOs: 1 -6 or 
a homologuc thereof; or (ii) the mis-expression of a 77 gene. In preferred embodiments, 
detecting the genetic lesion includes ascertaining the existence of at least one of: a 
deletion of one or more nucleotides from a 77 gene; an addition of one or more 

15 nucleotides to the gene, a substitution of one or more nucleotides of the gene, a gross 
chromosomal rearrangement of the gene; an alteration in the level of a messenger RNA 
transcript of the gene; the presence of a non-wild type splicing pattern of a messenger 
RNA transcript of the gene; a non-wild type level of the protein; and/or an aberrant level 
of soluble 77 protein. 

20 For example, detecting the genetic lesion can include (i) providing a 

probe/primer comprised of an oligonucleotide which hybridizes to a sense or antisense 
sequence of a TI gene or naturally occurring mutants thereof, or 5* or 3' flanking 
sequences naturally associated with the 77 gene; (ii) contacting the probe/primer to an 
appropriate nucleic acid containing sample; and (iii) detecting, by hybridization of the 

25 probe/primer to the nucleic acid, the presence or absence of the genetic lesion; e.g. 
wherein detecting the lesion comprises utilizing the probe/primer to determine the 
nucleotide sequence of the TI gene and, optionally, of the flanking nucleic acid 
sequences. For instance, the primer can be employed in a polymerase chain reaction 
(PCR) or in a ligation chain reaction (LCR). In alternate embodiments, the level of a 77 

30 protein is detected in an immunoassay using an antibody which is specifically 
immunoreactive with the 77 protein. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 
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Brief Descrip tion of the Drawing s 

Figure 1 shows the DNA sequence of a novel human 77 gene, £ coli pJasmid 
Ptyhq049, ATCC designation no. 98125 (hTI-1) (SEQ ID NO. I) and a deduced amino 

acid sequence. 

5 p.vho05?lTrr? " **■ ° NA ° f 3 " 0Vel huraan * 

ptyhqOSS, ATCC destgnauon no. 98127 (hTl-2) (SEQ ID NO:2) and a deduced amino 

acid sequence. 

10 a"ncl " " 9 " M (hTM) <SEQ ' D N0:3) a " d 2 ded ^ d — » 

< shows the DNA sequence of a novel murine 77 gene, E coli plasmid 
p.yh.10, (mfOXSEQiDNO^andadeducedaminoacidseoLe 

PtyhqOS^CC d 0WS "* ^ ° f 3 hUma " " 8Cne ' £ «* ^ 

p.yhq054, ATCC des, gn a,,„„ no. 98 1 26 (hTT-4) (SEQ ID NO:5) and a deduced amino 

15 acid sequence. 

« shows Ihe DNA sequence of a novel murine 77 gene, £ coli plasmid 
p,yh, 1 02(mT,-4) (S EQ,DNO : 6,a„daded U cedami„ O acidseqLce. 

Detailed DesrHptj on Q f the Invpntinr, 

as the Tub mtemctor- or -77" genes, which funcion in biochemical pathways invo.ved 
■n weight control and/or related disorder, such as diabetes. 

S » »<>™1 "Senes were cloned and identified based on their ability ,„ m^c 
with the C-termmus (i e. the last 44 amino acids) of M i„ a , w0 hybrid assay 3, „„„„ 
desenbedm the following Examples. hTl-I isa 1386 base pair nucleic acid the 
sequence of which is presented in Figure I (SEQIDNO : |). Based on sequence 
analysts, the polypeptide encoded by the gene is a putative serine protease 
v ^"iL" * 2miax pair mckk «*>• *« seq««nce of which is presented in 

22 f m ~ ANK (U ankyri "» -bating 

•ha me protetn encoded by the nucleic acid specifically recognise proteins and/or 

nuc fae ac,d molecules (Michaely, P. and V. Bennett < 1 992) W fc CM U**, 
2:127-129) Based on Northern analysis, a major band of 2.4kb and a minor band of8kb 
correspond,^ ,„ TH was expressed in al, human tissue and cell lines tested. However 
the h.ghes, expression occurred in the testis, pancreas, liver, uterus and brain 

hTl-3 ,s a 1048 base pair nucleic acid, the sequence of which is presented in 
Ftgure 3 (SEQ ID NO : 3). The sequence contams TPR repeats (i.e. tetraLpCd 
repeats, and also DNAJ w indicating that the protein encoded by the nuclec acid 
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is involved in protein-protein interactions. Based on Northern analysis, a major band of 
2.2kb and a minor band of 1 .2kb corresponding to hTI-3 was expressed in all human 
tissue and cell lines tested. However, the highest expression occurred in skeletal muscle, 
liver, heart and testis. 

5 mTI-3 is a 1700 base pair nucleic acid, the sequence of which is presented in 

Figure 4 (SEQ ID NO:4). A sequence comparison of a 1035 base region indicates that 
the human and mouse genes are 86.8% identical. Like the human, the murine sequence 
contains TPR repeats (i.e. tetraticopeptide repeats) (Silkorski, R.J. et al M (January 26, 
1 990) Cell 60:307-3 17; Lee, T.G.etal. (April 1994)M>/. Cell. Biol. 14:2331-2342; 

10 Barber, G.N et al. (May 1994) Proc. Nail. Acad ScL USA 91 :4278-4282) and also 
DNAJ repeats (Silver, P. A. (July 16, 1993) Cell 74:5-6), indicating that the protein 
encoded by the nucleic acid is involved in protein-protein interactions. Based on 
Northern analysis, a major band of 1.4kb corresponding to mTl-3 was expressed in all 
murine tissue tested (both tub and B6). However, the highest expression occurred in 

1 5 skeletal muscle, liver, heart and testis. 

hTI-4 is a 1421 base pair nucleic acid, the sequence of which is presented in 
Figure 5 (SEQ ID NO:5). The sequence contains RING finger repeats (Saurin, A.J. et 
al. (June 1996) TIBS 21:) and also Zinc finger repeats (Lovering R. et al. (March 1993) 
Proc. Natl Acad Sci. USA 90:21 12-21 16) indicating that the protein encoded by the 

20 nucleic acid is involved in nucleic acid (i.e. DNA or RNA) interactions. Based on 
Northern analysis, bands of 4 and 3 kb corresponding to hTI-4 was expressed in all 
human tissue and cell lines tested. In addition, al .4 kb band was strongly expressed in 
testis. Further, a band corresponding to 2.4 kb was expressed in the human SHEP, 
SHSY5Y, SKNMC and SKNSH cell lines. 

25 mTI-4 is a 2121 base pair nucleic acid, the sequence of which is presented in 

Figure 6 (SEQ ID NO:6). A sequence comparison of a 959 base region indicates that the 
human and mouse genes arc 86.8% identical. Like the human, the murine sequence 
contains RING finger repeats and also Zinc finger repeats indicating that the protein 
encoded by the nucleic acid is involved in nucleic acid (i.e. DNA or RNA) interactions. 

30 Based on Northern analysis, major bands of 3.0 and 2.4kb corresponding to mTI-4 was 
expressed in all murine tissue tested. In addition, a 1.4 kb band was expressed in Tub 
and B6 mouse. 

Another 77 gene (hTI-5) was identified as encoding human serine 
palmitoyltransferase (GenBank Accession No. U 15555), an enzyme that catalyzes the 
35 committed step in sphingolipid and ceramide biosythesis. Ceramide is a second 
messenger that regulates apoptosis via PP2A (Nickels, J.T. and J.R. Broach (1996) 
Genes & Development 10:382-394. 
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The cDNAs corresponding to 77 gene transcripts were initially cloned from 
human breast tissue based on the ability of their encoded proteins to bind to the C- 
termma. domain (i.e. the .ast 44 amino acids) of the htub gene product in an assay that 
detects protem/protein interactions, placing the TVgene products in the same 
5 biochemical pathway as tub. The tub protein is described in U.S. patent app.ication 
serial number 08/63 1 ,200 filed on April 1 2, 1 996 

Accordingly, certain aspects of the present invention relate to nucleic acid 
molecules encoding 77 proteins, the TVproteins, antibodies immunorcactive with 77 
proteins and preparations of such compositions. In addition, drug discovery assays are 
provided for identifying agents which can modulate the biological function of Tl 
proteins, such as by altering the interaction of 77 molecules with either downstream or 
upstream elements in the tub signal transduction pathway. Such agents can be useful 
therapeutically, for example, to modulate weight control and/or diabetes. Moreover the 
present invention provides diagnostic and therapeutic assays and reagents for detecting 
and treating disorders involving, for example, aberrant expression (or loss thereof) of Tl 
genes. Other aspects of the invention are described below or will be apparent to those 
skilled m the art m light of the present disclosure. 

For convenience, the meaning of certain terms and phrases employed in the 
spec.ficahon, examples, and appended claims are provided below 

"Cells," "host cells" or "recombinant host cells" arc terms used interchangeably 
herein. It ,s understood that such terms refer not only to the particular subject cel. but to 
the progeny or potential progeny of such a cell. Because certain modifications may 
occur ,n succeeding generations due to either mutation or environmental influences 

25 T c Pr08en r ay n0t ' " ^ * ^ 10 ^ ^ Cdl ' but arc sti » -eluded within 
25 the scope of the term as used herein. 

A "chimeric protein" or "fusion protein" is a fusion of a first amino acid 
sequence encoding one of the subject Tl polypeptides with a second amino acid 
sequence defining a domain (e.g. polypeptide portion) foreign to and not substantially 
homologous with any domain of one of the Tl polypeptides. A chimeric protein may 
present a foreign domain which is found (albeit in a different protein) in an organism 
which also expresses the first protein, or it may be an "interspecies", "intcrgenic" etc 
fusion of protein structures expressed by different kinds of organisms. In general a ' 
fus>on protein can be represented by the general formula X-77-Y, wherein 77 represents 
a portion of the protein which is derived from one of the Tl proteins, and X and Y are 
,ndependent.y absent or represent amino acid sequences which are not related to one of 
the 77am,no acid sequences in an organism, including naturally occurring mutants 
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"Complementary" sequences as used herein refer to sequences which have 
sufficient complementarity to be able to hybridize, forming a stable duplex. 

A "delivery complex" shall mean a targeting means (e.g. a molecule that results 
in higher affinity binding of a gene, protein, polypeptide or peptide to a target cell 
5 surface and/or increased cellular uptake by a target cell). Examples of targeting means 
include: sterols (e.g. cholesterol), lipids (e.g. a cationic lipid, virosome or liposome), 
viruses (e.g. adenovirus, adeno-associated virus, and retrovirus) or target cell specific 
binding agents (e.g. ligands recognized by target cell specific receptors). Preferred 
complexes are sufficiently stable in vivo to prevent significant uncoupling prior to 
10 internalization by the target cell. However, the complex is cleavable under appropriate 
conditions within the cell so that the gene,' protein, polypeptide or peptide is released in 
a functional form. 

As is well known, genes for a particular polypeptide may exist in single or 
multiple copies within the genome of an individual. Such duplicate genes may be 

1 5 identical or may have certain modifications, including nucleotide substitutions, additions 
or deletions, which all still code for polypeptides having substantially the same activity. 
The term "DNA sequence encoding a 77 polypeptide" may thus refer to one or more 
genes within a particular individual. Moreover, certain differences in nucleotide 
sequences may exist between individual organisms, which are called alleles. Such allelic 

20 differences may or may not result in differences in amino acid sequence of the encoded 
polypeptide yet still encode a protein with the same biological activity. 

As used herein, the term "gene" or "recombinant gene" refers to a nucleic acid 
molecule comprising an open reading frame encoding one of the 77 polypeptides of the 
present invention, including both exon and (optionally) intron sequences. A 

25 "recombinant gene" refers to nucleic acid molecule encoding a 77 polypeptide and 

comprising 77 protein-encoding exon sequences, though it may optionally include intron 
sequences which are either derived from a chromosomal 77 gene or from an unrelated 
chromosomal gene. Exemplary recombinant genes encoding the subject 77 polypeptides 
are represented in the appended Sequence Listing. The term "intron" refers to a DNA 

30 sequence present in a given gene which is not translated into protein and is generally 
found between exons. 

"Homology" or "identity" or "similarity" refers to sequence similarity between 
two peptides or between two nucleic acid molecules. Homology can be determined by 
comparing a position in each sequence which may be aligned for purposes of 

35 comparison. When a position in the compared sequence is occupied by the same base or 
amino acid, then the molecules are homologous at that position. A degree of homology 
between sequences is a function of the number of matching or homologous positions 
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shared by the sequences. An "unrelated" or "non-homologous" sequence shares less 
than 40% identity, though preferably less than 25% identity, with one of the 77 
sequences of the present invention. 

The term "interact" as used herein is meant to include detectable interactions 
> between molecules, such as can be detected using, for example, a yeast two hybrid 
assay. The term interact is also meant to include "binding" interactions between 
molecules. Interactions may be protein-protein or protein-nucleic acid in nature 

The term "isolated" as used herein with respect to nucleic acids, such as DNA or 
RNA, refers to molecules separated from other DNAs or RNAs, respectively that are 
' present ,n the natural source of the macromolecule. For example, an isolated nucleic 
and encoding one of the subject 77 polypeptides preferably includes no m ore than 10 
k,lobascs (kb) of nucleic acid sequence which naturally immediately flanks the 77 gene 
m genom.c DNA, more preferably no more than 5kb of such naturally occurring 
flanking sequences, and most preferably less than 1 .5kb of such naturally occurring 
flanking sequence. The term isolated as used herein also refers to a nucleic acid or 
pept.de that is substantially free of cellular material, viral material, or culture medium 
when produced by recombinant DNA techniques, or chemical precursors or other 
chemicals when chemically synthesized. Moreover, an "isolated nucleic acid" is meant 
to mclude nucleic acid fragments which are not naturally occurring as fragments and 
would not be found in the natural state. The term "isolated" is also used herein to refer 
to polypept.des which are isolated from other cellular proteins and is meant to 
encompass both purified and recombinant polypeptides. 

The term "modulation" as used herein refers to both upregulation, i.e., 
stimulation, and downregulation, i.e. suppression, of a response. 

The "non-human animals" of the invention include mammalians such as rodents 
non-human primates, sheep, dog, cow, chickens, amphibians, reptiles, etc Preferred 
non-human animals are selected from the rodent family including rat and mouse most 
preferably mouse. The term "chimeric animal" is used herein to refer to animals in 
which the recombinant gene is found, or in which the recombinant is expressed in some 
but not all cells of the animal. The term "tissue-specific chimeric animal" indicates that 
one of the recombinant 77genes is present and/or expressed or disrupted in some tissues 
but not others. 

As used herein, the term "nucleic acid" refers to polynucleotides such as 
deoxynbonucleic acid (DNA), and, where appropriate, ribonucle.c acid (RNA) The 
term should also be understood to include, as equivalents, analogs of either RNA or 
DNA made from nucleotide analogs, and. as applicable to the embodiment being 
described, single (sense or antisense) and double-stranded pel vnucleotides 
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As used herein, the term "promoter" means a DNA sequence that regulates 
expression of a selected DNA sequence operably linked to the promoter, and which 
effects expression of the selected DNA sequence in cells. The term encompasses "tissue 
specific" promoters, i.e. promoters, which effect expression of the selected DNA 
5 sequence only in specific cells (e.g. cells of a specific tissue). The term also covers so- 
called "leaky" promoters, which regulate expression of a selected DNA primarily in one 
tissue, but cause expression in other tissues as well. The term also encompasses non- 
tissue specific promoters and promoters that constitutively express or that arc inducible 
(i.e. expression levels can be controlled). 

10 The terms "protein", "polypeptide" and "peptide" are used interchangably herein 

when referring to a gene product. 

The term "recombinant protein" refers to a polypeptide of the present invention 
which is produced by recombinant DNA techniques, wherein generally, DNA encoding 
a 77 polypeptide is inserted into a suitable expression vector which is in turn used to 

15 transform a host cell to produce the heterologous protein. Moreover, the phrase "derived 
from", with respect to a recombinant 77 gene, is meant to include within the meaning of 
"recombinant protein" those proteins having an amino acid sequence of a native 77 
protein, or an amino acid sequence similar thereto which is generated by mutations 
including substitutions and deletions (including truncation) of a naturally occurring form 

20 of the protein. 

As used herein, the term "specifically hybridizes" or "specifically detects" refers 
to the ability of a nucleic acid molecule of the invention to hybridize to at least 
approximately 6, 12, 20, 30, 50, 100, 1 50, 200, 300, 350, 400 or 425 consecutive 
nucleotides of a vertebrate, preferably mammalian, 77 gene, such as the 77 sequence 

25 designated in one of SEQ ID NOs: 1 -6, or a sequence complementary thereto, or 
naturally occurring mutants thereof, such that it shows more than 10 times more 
hybridization, preferably more than 100 times more hybridization, and even more 
preferably more than 100 times more hybridization than it does to a cellular nucleic acid 
(e.g., mRNA or genomic DNA) encoding a protein other than a vertebrate, preferably 

30 mammalian, Tl protein as defined herein. 

"Transcriptional regulatory sequence" is a generic term used throughout the 
specification to refer to DNA sequences, such as initiation signals, enhancers, and 
promoters, which induce or control transcription of protein coding sequences with which 
they are operably linked. In preferred embodiments, transcription of one of the 

35 recombinant 77 genes is under the control of a promoter sequence (or other 

transcriptional regulatory sequence) which controls the expression of the recombinant 
gene in a cell-type in which expression is intended. It will also be understood that the 
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recombinant gene can be under the control of transcriptional regulatory sequences which 
are the same or which are different from those sequences which control transcription of 
the naturally-occurring forms of 77 proteins. 

As used herein, the term "transfection" means the introduction of a nucleic acid, 
5 e.g., an expression vector, into a recipient cell by nucleic acid-mediated gene transfer. ' 
"Transformation", as used herein, refers to a process in which a cell's genotype is 
changed as a result of the cellular uptake of exogenous DNA or RNA, and, for example, 
the transformed cell expresses a recombinant form of a 77 polypeptide or, in the case of 
anti-sense expression from the transferred gene, the expression of a naturally-occurring 
1 0 form of the 77 protein is disrupted. 

As used herein, the term "transgene" means a nucleic acid sequence encoding, 
e.g., one of the 77 polypeptides, or an antisense transcript thereto, which is partly or 
entirely heterologous, i.e., foreign, to the transgenic animal or cell into which it is 
introduced, or, is homologous to an endogenous gene of the transgenic animal or cell 
1 5 into which it is introduced, but which is designed to be inserted, or is inserted, into the 
animal's genome in such a way as to alter the genome of the cell into which it is inserted 
(e.g., it is inserted at a location which differs from that of the natural gene or its insertion 
results in a knockout). A transgene can include one or more transcriptional regulatory 
sequences and any other nucJeic acid, (e.g. as intron), that may be necessary for optimal 
20 expression of a selected nucJeic acid. 

A "transgenic animal" refers to any animal, preferably a non-human mammal, 
bird or an amphibian, in which one or more of the cells of the animal contain 
heterologous nucleic acid introduced by way of human intervention, such as by 
transgenic techniques well known in the art. The nucleic acid is introduced into the cell, 
25 directly or indirectly by introduction into a precursor of the cell, by way of deliberate 
genetic manipulation, such as by microinjection or by infection with a recombinant 
virus. The term genetic manipulation does not include classical cross-breeding, or in 
vitro fertilization, but rather is directed to the introduction of a recombinant DNA 
molecule. This molecule may be integrated within a chromosome, or it may be 
30 extrachrornosomally replicating DNA. In the typical transgenic animals described 
herein, the transgene causes cells to express a recombinant form of one of the 77 
proteins, e.g. either agonistic or antagonistic forms. However, transgenic animals in 
which the recombinant 77 gene is silent are also contemplated, as for example, the FLP 
or CRE recombinase dependent constructs described below. Moreover, "transgenic 
animal" also includes those recombinant animals in which gene disruption of one or 
more 77 genes is caused by human intervention, including both recombination and 
antisense techniques. 
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As used herein, the term "vector" refers to a nucleic acid molecule capable of 
transporting another nucleic acid to which it has been linked. One type of preferred 
vector is an episome, i.e., a nucleic acid capable of extra-chromosomal replication. 
Preferred vectors are those capable of autonomous replication and/expression of nucleic 
5 acids to which they are linked. Vectors capable of directing the expression of genes to 
which they are operatively linked are referred to herein as "expression vectors". In 
general, expression vectors of utility in recombinant DNA techniques are often in the 
form of "plasmids" which refer generally to circular double stranded DNA loops which, 
in their vector form are not bound to the chromosome. In the present specification, 
10 "plasmid" and "vector" are used interchangeably as the plasmid is the most commonly 
used form of vector. However, the invention is intended to include such other forms of 
expression vectors which serve equivalent functions and which become known in the art 
subsequently hereto. 

15 Nucleic Acids 

As described below, one aspect of the invention pertains to isolated nucleic acids 
comprising nucleotide sequences encoding Tub interactor or 77 polypeptides, and/or 
equivalents of such nucleic acids. The term equivalent is understood to include 
nucleotide sequences encoding functionally equivalent 77 polypeptides or functionally 

20 equivalent peptides having an activity of a 77 protein such as described herein. 

Equivalent nucleotide sequences will include sequences that differ by one or more 
nucleotide substitution, addition or deletion, such as allelic variants; and will, therefore, 
include sequences that differ from the nucleotide sequence of the 77 gene shown in SEQ 
ID NOs:l-6 due to the degeneracy of the genetic code. 

25 Preferred nucleic acids are vertebrate 77 nucleic acids. Particularly preferred 

vertebrate 77 nucleic acids are mammalian. Regardless of species, particularly preferred 
77 nucleic acids encode polypeptides that are at least 80% similar to an amino acid 
sequence of a vertebrate 77 protein. In one embodiment, the nucleic acid is a cDNA 
encoding a polypeptide having at least one bioactivity of the subject 77 polypeptide. 

30 Preferably, the nucleic acid includes all or a portion of the nucleotide sequence 
corresponding to the nucleic acid of SEQ ID NOs: 1,3,5, 7, or 9. 

Still other preferred nucleic acids of the present invention encode a 77 
polypeptide which is comprised of at least 2, 5, 1 0, 25, 50, 1 00. 1 50 or 200 amino acid 
residues. For example, preferred nucleic acid molecules for use as probes/primer or 

35 antisense molecules (i.e. noncoding nucleic acid molecules) can comprise at least about 
6, 12, 20, 30, 50, 100, 125, 150 or 200 base pairs in length, whereas coding nucleic acid 
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molecules can comprise about 300, 400, 500, 600, 700, 800, 900, 950 975 1000 1005 
1010 or 1015 base pairs. ' 

Another aspect of the invention provides a nucleic acid which hybridizes under 
stringent conditions to a nucleic acid represented by one of SEQ ID NOs- 1-6 
5 Appropriate stringency conditions which promote DNA hybridization, for example 6 0 
x sodium chloride/sodium citrate (SSC) at about 45°C, followed by a wash of 2 0 x SSC 
at 50°C are known ,o those skilled in the art or ca* be found in Current Protocols in 
Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For example the salt 
1 0 ™ ati °" in h thC W3sh Ste P Can * —ted from a low stringency of about 2.0 x SSC 
at 50 C to a high stringency of about 0.2 x SSC at 50°C. In addition, the temperature in 
the wash step can be increased from low stringency conditions at room temperature 
about 22°C, to high stringency conditions at about 65°C. Both temperature and salt may 
be varied, or temperature of salt concentration may be held constant while the other 
variable ,s changed. In a preferred embodiment, a 77 nucleic acid of the present 
invent™ will bind to one of SEQ ID NOs: 1 -6 under moderately stringent conditions 
for example at about 2.0 x SSC and about 4 0 o C . In a particularly preferred 
embodiment, a 77 nucleic acid of the present invention will bind to one of SEQ ID 
NOs: I -6 under high stringency conditions. 

Preferred nucleic acids have a sequence at least 75% homologous and more 
preferably 80% and even more preferably at least 85% homologous with an amino acid 
sequence of a 7V gene, e.g., such as a sequence shown in one of SEQ ID NOs l 6 
Nucleic acids at least 90%, more preferably 95%, and most preferab.y at least about 98- 
99/o homologous with a nucleic sequence represented in one of SEQ ID NOs- 1 -6 are of 
course also within the scope of the invention. In preferred embodiments, the nucleic 
acid ,s mammalian and in particularly preferred embodiments, includes all or a portion 
of the nucleotide sequence corresponding to the coding region of one of SEQ ID 
NOs: 1-6. 

Nucleic acids having a sequence that differs from the nucleotide sequences 
shown in one of SEQ ID NOs: 1 -6 due to degeneracy in the genetic code are a.so within 
he scope of the invention. Such nucleic acids encode functionally equivalent peptides 
(i.c. a peptide having a biological activity of a 77 polypeptide) but differ in sequence 
from the sequence shown in the sequence listing due to degeneracy in the genetic code 
For example, a number of amino acids are designated by more than one triplet Codons 
that specify the same amino acid, or synonyms (for example, CAU and CAC each 
encode histidine) may result in "silent" mutations which do not affect the amino acid 
sequence of a 77 polypeptide. However, it is expected that DNA sequence 
polymorphisms that do lead to changes in the amino acid sequences of the subject 77 
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polypeptides will exist among mammalians. One skilled in the art will appreciate that 
these variations in one or more nucleotides (e.g., up to about 3-5% of the nucleotides) of 
the nucleic acids encoding polypeptides having an activity of a 77 polypeptide may exist 
among individuals of a given species due to natural allelic variation. 
5 As indicated by the examples set out below, 77 protein-encoding nucleic acids 

can be obtained from mRNA present in any of a number of eukaryotic cells. It should 
also be possible to obtain nucleic acids encoding 77 polypeptides of the present 
invention from genomic DNA from both adults and embryos. For example, a gene 
encoding a 77 protein can be cloned from either a cDNA or a genomic library in 

10 accordance with protocols described herein, as well as those generally known to persons 
skilled in the art. Examples of tissues and/or libraries suitable for isolation of the subject 
nucleic acids include breast, among others. A cDN A encoding a 77 protein can be 
obtained by isolating total mRNA from a cell, e.g. a vertebrate cell, a mammalian cell, 
or a human cell, including embryonic cells. Double stranded cDNAs can then be 

1 5 prepared from the total mRNA, and subsequently inserted into a suitable plasmid or 
bacteriophage vector using any one of a number of known techniques. The gene 
encoding a TI protein can also be cloned using established polymerase chain reaction 
techniques in accordance with the nucleotide sequence information provided by the 
invention. The nucleic acid of the invention can be DNA or RNA or analogs thereof. A 

20 preferred nucleic acid is a cDNA represented by a sequence selected from the group 
consisting of SEQ ID NOs:l-6. 

Vectors. 

This invention also provides expression vectors containing a nucleic acid 
25 encoding a 77 polypeptide, operably linked to at least one transcriptional regulatory 

sequence. "Operably linked" is intended to mean that the nucleotide sequence is linked 
to a regulatory sequence in a manner which allows expression of the nucleotide 
sequence. Regulatory sequences are art-recognized and arc selected to direct expression 
of the subject TI proteins. Accordingly, the term "transcriptional regulatory sequence" 
30 includes promoters, enhancers and other expression control elements. Such regulatory 
sequences are described in Goeddel; Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, CA (1990). In one embodiment, the 
expression vector includes a recombinant gene encoding a peptide having an agonistic 
activity of a subject TI polypeptide, or alternatively, encoding a peptide which is an 
35 antagonistic form of the 77 protein. Such expression vectors can be used to transfect 
cells and thereby produce polypeptides, including fusion proteins, encoded by nucleic 
acids as described herein. Moreover, the gene constructs of the present invention can 
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also be used as a part of a gene therapy protocol to deliver nucleic acids encoding either 
an agon.stic or antagonistic form of one of the subject 77 proteins. Thus, another aspect 
of the invention features expression vectors for in vivo or in vitro transfection and 
express^ of a 77 polypeptide in particular cell types so as to reconstitute the function 
of, or alternatively, abrogate the function of 77-induced signaling in a tissue. This could 
be desirable, for example, when the naturally-occurring form of the protein is 
m.sexpressed; or to deliver a form of the protein which alters differentiation of tissue 
Express.on vectors may also be employed to inhibit neoplastic transformation 

In addition to viral transfer methods, such as those illustrated above non-viral 
methods can also be employed to cause expression of a subject 77 polypeptide in the 
tissue of an animal. Most nonviral methods of gene transfer rely on normal mechanisms 
used by mammalian cells for the uptake and intracellular transport of macromolccules 
In preferred embodiments, non-viral targeting means of the present invention rely on' ' 
endocync pathways for the uptake of the subject 77 polypeptide gene by the targeted 
cell. Exemplary targeting means of this type include liposomal derived systems poly- 
lysine conjugates, and artificial viral envelopes. 

Probes and Primprg 

Moreover, the nucleotide sequences determined from the cloning of 77 genes 
from mammalian organisms will further allow for the generation of probes and primers 
des,gned for use in identifying and/or cloning Tl homologues in other cell types e g 
from other tissues, as well as Tl homologues from other mammalian organisms ' For 
instance, the present invention also provides a probe/primer comprising a substantially 
punfied oligonucleotide, which oligonucleotide comprises a region of nucleotide 
sequence that hybridizes under stringent conditions to at least approximately 12 
preferably 25, more preferably 40, 50 or 75 consecutive nucleotides of sense or anti- 
sense sequence selected from the group consisting of SEQ ID NOs: 1 -6 or naturally 
occurring mutants thereof. For instance, primers based on the nucleic acid represented 
.n SEQ ID NOs:l , 3, 5, 7 or 9 can be used in PGR reactions to clone Tl homologues 
Preferred primers for hTI-4 are set forth as SEQ ID NOs:9 and 10. Preferred primers for 

h SEomNn 0 t ,D N ° S: ' 3 ^ H - Preferred primers for hTI - 3 - - «** 

•n SEQ ID NOs: 1 7 and 1 8. Preferred primers for hTI-1 are set forth in SEQ ID NOs-? I 
and 22. Preferred primers for mTI-4 are set forth in SEQ ID NOs:25 and 26 Preferred 
primers for hTI-2 are set forth in SEQ ID NOs:29 and 30. 

Likewise, probes based on the subject 77sequences can be used to detect 
transenpts or genomic sequences encoding the same or homologous proteins In 
preferred embodiments, the probe further comprises a label group attached thereto and 
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able to be detected, e.g. the label group is a radioisotope, a fluorescent compound, an 
enzyme, or an enzyme co-factor. 

As discussed in more detail below, such probes can also be used as a part of a 
diagnostic test kit for identifying cells or tissue which misexpress a 77 protein, such as 
5 by measuring a level of a 77-encoding nucleic acid in a sample of cells from a patient; 
e.g. detecting 77 mRNA levels or determining whether a genomic 77 gene has been 
mutated or deleted. Briefly, nucleotide probes can be generated from the subject 77 
genes which facilitate histological screening of intact tissue and tissue samples for the 
presence (or absence) of 77-encoding transcripts. Similar to the diagnostic uses of anti- 

1 0 77 antibodies, the use of probes directed to 77 messages, or to genomic 77 sequences, can 
be used for both predictive and therapeutic evaluation of allelic mutations which might 
be manifest in, for example, a predisposition to diabetes. Used in conjunction with 
immunoassays as described herein, the oligonucleotide probes can help facilitate the 
determination of the molecular basis for a disorder which may involve some abnormality 

15 associated with expression (or lack thereof) of a TI protein. For instance, variation in 
polypeptide synthesis can be differentiated from a mutation in a coding sequence. 

Antisense, Ribozvme and Triplex techniques 

Another aspect of the invention relates to the use of the isolated nucleic acid in 

20 "antisense" therapy. As used herein, "antisense" therapy refers to administration or in 
situ generation of oligonucleotide molecules or their derivatives which specifically 
hybridize (e.g. bind) under cellular conditions, with the cellular mRNA and/or genomic 
DNA encoding one or more of the subject 77 proteins so as to inhibit expression of that 
protein, e.g. by inhibiting transcription and/or translation. The binding may be by 

25 conventional base pair complementarity, or, for example, in the case of binding to DNA 
duplexes, through specific interactions in the major groove of the double helix. In 
general, "antisense" therapy refers to the range of techniques generally employed in the 
art, and includes any therapy which relies on specific binding to oligonucleotide 
sequences. 

30 An antisense construct of the present invention can be delivered, for example, as 

an expression plasmid which, when transcribed in the cell, produces RNA which is 
complementary to at least a unique portion of the cellular mRNA which encodes a 77 
protein. Alternatively, the antisense construct is an oligonucleotide probe which is 
generated ex vivo and which, when introduced into the cell causes inhibition of 

35 expression by hybridizing with the mRNA and/or genomic sequences of a TI gene. 
Such oligonucleotide probes are preferably modified oligonucleotides which are 
resistant to endogenous nucleases, e.g. exonucleases and/or endonucleases, and are 
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therefore stable in vivo. Exemplary nucleic acid molecules for use as antisense 
oligonucleotides are phosphoramidatc, phosphothioate and methylphosphonate analogs 
of DNA (see also U.S. Patents 5,176,996; 5,264,564; and 5,256,775). Additionally, 
general approaches to constructing oligomers useful in antisense therapy have been 
5 reviewed, for example, by Van der Krol et al. ( 1 988) Biotechniques 6:958-976; and 
Stein et al. (1 988) Cancer Res 48:2659-2668. With respect to antisense DNA, 
oligodeoxyribonucleotides derived from the translation initiation site, e.g., between the 
- 1 0 and + 1 0 regions of the 77 nucleotide sequence of interest, are preferred. Particularly 
preferred antisense molecules are set forth in SEQ ID NOs:l 1, 15, 19, 23 and 27 
1 0 Antisense approaches involve the design of oligonucleotides (either DNA or 

RNA) that are complementary to 77 mRNA. The antisense oligonucleotides will bind to 
the 77 mRNA transcripts and prevent translation. Absolute complementarity, although 
preferred, is not required. A sequence "complementary" to a portion of an RNA, as 
referred to herein, means a sequence having sufficient complementarity to be able to 
1 5 hybridize with the RNA, forming a stable duplex; in the case of double-stranded 

antisense nucleic acids, a single strand of the duplex DNA may thus be tested, or triplex 
formation may be assayed. The ability to hybridize will depend on both the degree of 
complementarity and the length of the antisense nucleic acid. Generally, the longer the 
hybridizing nucleic acid, the more base mismatches with an RNA it may contain and 
20 still form a stable duplex (or triplex, as the case may be). One skilled in the art can 

ascertain a tolerable degree of mismatch by use of standard procedures to determine the 
melting point of the hybridized complex. 

Oligonucleotides that are complementary to the 5" end of the message, e.g., the 5' 
untranslated sequence up to and including the AUG initiation codon, should work most 
25 efficiently at inhibiting translation. However, sequences complementary to the 3* 

untranslated sequences of mRNAs have recently been shown to be effective at inhibiting 
translation of mRNAs as well. (Wagner, R. (1 994) Nature 372:333). Therefore, 
oligonucleotides complementary to either the 5' or 3' untranslated, non-coding regions of 
a 77 gene could be used in an antisense approach to inhibit translation of endogenous 77 
mRNA. Oligonucleotides complementary to the 5' untranslated region of the mRNA 
should include the complement of the AUG start codon. Antisense oligonucleotides 
complementary to mRNA coding regions are less efficient inhibitors of translation but 
could be used in accordance with the invention. Whether designed to hybridize to the 5', 
3* or coding region of 77 mRNA, antisense nucleic acids should be at least six 
nucleotides in length, and are preferably less that about 100 and more preferably less 
than about 50, 25, 1 7 or 10 nucleotides in length. 
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Regardless of the choice of target sequence, it is preferred that in vitro studies are 
first performed to quantitate the ability of the antisense oligonucleotide to quantitate the 
ability of the antisense oligonucleotide to inhibit gene expression. It is preferred that 
these studies utilize controls that distinguish between antisense gene inhibition and 
5 nonspecific biological effects of oligonucleotides. It is also preferred that these studies 
compare levels of the target RNA or protein with that of an internal control RNA or 
protein. Additionally, it is envisioned that results obtained using the antisense 
oligonucleotide are compared with those obtained using a control oligonucleotide. It is 
preferred that the control oligonucleotide is of approximately the same length as the test 
1 0 oligonucleotide and that the nucleotide sequence of the oligonucleotide differs from the 
antisense sequence no more than is necessary to prevent specific hybridization to the 
target sequence. 

The oligonucleotides can be DNA or RNA or chimeric mixtures or derivatives or 
modified versions thereof, single-stranded or double-stranded, the oligonucleotide can 

15 be modified at the base moiety, sugar moiety, or phosphate backbone, for example, to 
improve stability of the molecule, hybridization, etc. The oligonucleotide may include 
other appended groups such as peptides (e.g., for targeting host cell receptors in vivo), or 
agents facilitating transport across the cell membrane (see, e.g., Letsinger et al. (1989) 
Proc. Natl Acad. Sci. U.S.A. 86:6553-6556; Lemaitre et al. (1987) Proc. Natl Acad. ScL 

20 USA 84:648-652; PCT Publication No. WO 88/098 1 0, published December 15,1 988) or 
the blood-brain barrier (see, e.g., PCT Publication No. WO 89/10134, published April 
25, 1988), hybridization-triggered cleavage agents. (See, e.g., Krol et al. (1988) 
BioTechniques 6:958-976) or intercalating agents. (See, e.g.. Zon (1988) Pharm. Res. 
5:539-549). To this end, the oligonucleotide may be conjugated to another molecule, 

25 e.g., a peptide, hybridization triggered cross-linking agent, transport agent, 
hybridization-triggered cleavage agent, etc. 

The antisense oligonucleotide may comprise at least one modified base moiety 
which is selected from the group including but not limited to 5-fluorouracil, 5- 
bromouracil, 5-chlorouraciU 5-iodouracil, hypoxanthinc, xanthine, 4-acetylcytosinc, 5- 

30 (carboxyhydroxyethyl) uracil, S-carboxymethylaminomethyl^-thiouridinc, 

5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosinc, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1 -mcthylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine. 5-methylcytosine, N6-adeninc, 
7-mcthylguanine, 5-methyIaminomethyIuracil, 5-mcthoxyaminonicthyl-2-thiouracil, 

35 beta-D-mannosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracil, 
2-methyhhio-N6-isopentenyladeninc, uracil-5-oxyacctic acid (v), wybutoxosine, 
pseudouracil, queosine, 2-thiocytosine, 5-methyI-2-lhiouracih 2-thiouraciL 4-thiouracik 
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5-methyluracil, uraciI-5- oxyacelic acid methylester. uracil-5-oxyacetic acid (v), 
5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 

2,6-diaminopurine. 

The anlisense oligonucleotide may also comprise at least one modified sugar 
moiety selected from the group including but not limited to arabinose, 2- 
fluoroarabinose, xylulose, and hexose. 

In yet another embodiment, the antisense oligonucleotide comprises at least one 
mod.fied phosphate backbone selected from the group consisting of a phosphorothioate 
a phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a 

phosphordiamidate, a methylphosphonate, an alkyl phosphotriester, and a formacetal or 
analog thereof. 

In yet a further embodiment, the antisense oligonucleotide is an a-anomeric 
ohgonucleotidc. An a-anomeric oligonucleotide forms specific double-stranded hybrids 
with complementary RNA in which, contrary to the usual P-units, the strands run 
parallel to each other (Gautier et al. (1987) Nucl. Acids Res. 15:6625-6641 ). The 
oligonucleotide is a r-O-methylribonucleotide (Inoue et al. (1987) Nucl. Acids Res. 
15:6131-6148), or a chimeric RNA-DNA analogue (Inoue et al. (1987) FEBS Lett 
215:327-330). 

Oligonucleotides of the invention may be synthesized by standard methods 
known in the art, e.g. by use of an automated DNA synthesizer (such as are 
commercially available from Biosearch, Applied Biosystems, etc.). As examples, 
phosphorothioate oligonucleotides may be synthesized by the method of Stein et al 
(1988) Nucl. Acids Res. 16:3209). methylphosphonate oligonucleotides can be prepared 
by use of controlled pore glass polymer supports (Sarin et al. (1988) Proc Natl. Acad. 
25 Sci. U.S. A. 85:7448-7451), etc. 

While antisense nucleotides complementary to the Tl coding region sequence can 
be used, those complementary to the transcribed untranslated region arc most preferred. 
For example, an antisense oligonucleotide as set forth in SEQ ID NOs: 1 1, 15, 19, 23 and 
27 can be utilized in accordance with the invention. 

The antisense molecules should be delivered to cells which express Tl in vivo A 
number of methods have been developed for delivering antisense DNA or RNA to cells- 
e.g., antisense molecules can be injected directly into the tissue site, or modified 
anusense molecules, designed to target the desired cells (e.g., antisense linked to 
pepndes or antibodies that specifically bind receptors or antigens expressed on the target 
35 cell surface) can be administered systematically. 

However, it is often difficult to achieve intracellular concentrations of the 
anhsense sufficient to suppress translation on endogenous mRNAs. Therefore a 
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preferred approach utilizes a recombinant DNA construct in which the antisense 
oligonucleotide is placed under the control of a strong pol III or pol II promoter. The 
use of such a construct to transfect target cells in the patient will result in the 
transcription of sufficient amounts of single stranded RNAs that will form 
5 complementary base pairs with the endogenous 77 transcripts and thereby prevent 

translation of the 77 mRNA. For example, a vector can be introduced in vivo such that it 
is taken up by a cell and directs the transcription of an antisense RNA. Such a vector 
can remain episomal or become chromosomally integrated, as long as it can be 
transcribed to produce the desired antisense RNA. Such vectors can be constructed by 

1 0 recombinant DNA technology methods standard in the art. Vectors can be plasmid, 

viral, or others known in the art, used for replication and expression in mammalian cells. 
Expression of the sequence encoding the antisense RNA can be by any promoter known 
in the art to act in mammalian, preferably human cells. Such promoters can be inducible 
or constitutive. Such promoters include but are not limited to: the SV40 early promoter 

15 region (Bernoist and Chambon (1 981) Nature 290:304-310), the promoter contained in 
the 3* long terminal repeat of Rous sarcoma virus (Yamamoto et al. (1980) Cell 22:787- 
797), the herpes thymidine kinase promoter (Wagner et al. (1981) Proa Natl. Acad. Sci 
U.S.A. 78:1441-1445), the regulatory sequences of the metallothionein gene (Brinster et 
al. (1982) Nature 296:39-42), etc. Any type of plasmid, cosmid, YAC or viral vector 

20 can be used to prepare the recombinant DNA construct which can be introduced directly 
into the tissue site; e.g.? the choroid plexus or hypothalamus. Alternatively, viral vectors 
can be used which selectively infect the desired tissue; (e.g., for brain, herpesvirus 
vectors may be used), in which case administration may be accomplished by another 
route (e.g., systematically). 

25 Ribozyme molecules designed to catalytically cleave 77 mRNA transcripts can 

also be used to prevent translation of 77 mRNA and expression of 77 (See, e.g., PCT 
Publication No. WO 90/1 1364, published October 4, 1990; Sarver et al. (1990) Science 
247:1222-1225 and U.S. Patent No. 5,093,246). While ribozymes that cleave mRNA at 
site specific recognition sequences can be used to destroy 77 mRNAs, the use of 

30 hammerhead ribozymes is preferred. Hammerhead ribozymes cleave mRNAs at 
locations dictated by flanking regions that form complementary base pairs with the 
target mRNA. The sole requirement is that the target mRNA have the following 
sequence of two bases: 5'-UG-3'. The construction and production of hammerhead 
ribozymes is well known in the art and is described more fully in HaselofT and Gerlach 

35 (1988) Nature 334:585-591 . There are hundreds of potential hammerhead ribozyme 
cleavage sites within the nucleotide sequence of human 77cDNA (Fig. 1). Preferably 
the ribozyme is engineered so that the cleavage recognition site is located near the 5' end 
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of the 77 mRNA; i.e., to increase efficiency and minimize the intracellular accumulation 
of non-functional mRNA transcripts. 

The ribozymes of the present invention also include RNA endoribonucleases 
(heremafter "Cech-type ribozymes") such as the one which occurs naturally in 
5 Tetrahymena thermophila (known as the I VS. or L-19 IVS RNA) and which has been 

,T,l V t fi d T ibed ^ ^ ^ ^ C ° 1,ab0rat0rS « •»■ 0 »♦> ^ience 

324.429-433; pubhshed PCT Publication No. WO 88/04300 by University Patents Inc • 
Been and Cech, (1 986) Cell 47:207-2,6). The Cech-type ribozymes have an eight base' 
pa,r act.ve site which hybridizes to a target RNA sequence whereafter cleavage of the 
target RNA takes place. The invention encompasses those Cech-type ribozymes which 
target e.ght base-pair active site sequences that are present in a TV gene. Particularly 
preferred ribozymes are set forth in SEQIDNOs:8, 12, 16, 20,24and28 

As in the antisense approach, the ribozymes can be composed of modified 
oligonucleotides (e.g. for improved stability, targeting, etc.) and should be delivered to 
ceMs which express the 77 gene /„ vivo. A preferred method of delivery involves using a 
— " e « codi "g" «* ribozyme under the control of a strong constitutive pol II, 
or pol II promoter, so that transfected cells will produce sufficient quantities of the 
nbozyme to destroy endogenous 77 messages and inhibit translation Because 
nbozymes unlike antisense molecules, are catalytic, a lower intracellular concentration 
is required for efficiency. 

Endogenous TI gene expression can also be reduced by inactivating or "knocking 
out then gene or ,ts promoter using targeted homologous recombination, (see eg. 
Sm.thies et al. (1985) Nature 3 1 7:230-234; Thomas and Capecchi (1 987) Cell 5 , 503- 
512; Thompson et al. (1989) Cell 5:313-32. ; each of which is incorporated by reference 
heretn ,n ,ts entirety). For example, a mutant, non-functional 77 (or a completely 
unrelated DNA sequence) flanked by DNA homologous to the endogenous 77 gene 
(ether the coding regions or regulatory regions of the TI gene) can be used, with or 
wthout a selectable marker and/or a negative selectable marker, to transfect cells that 
express 77/* vivo. Insertion of the DNA construct, via targeted homologous 
recombmation, results in inactivation of the 77gene. Such approaches are particularly 
suued m the agricultural field where modifications to ES (embryonic stem) cells can be 
used to generate animal offspring with an inactive TI(e. g ., see Thomas and Capecchi 
1 987. and Thompson, .989, supra). However this approach can be adapted for use in 
humans provided the recombinant DNA constructs are directly administered or targeted 
to the requ.red s„e /„ vivo using appropriate viral vectors, e.g., herpes virus vectors for 
dehvery to brain tissue; e.g., the hypothalamus and/or choroid plexus. 
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Altematively, endogenous 77 gene expression can be reduced by targeting 
deoxyribonucleotide sequences complementary to the regulatory region of the TI gene 
(i.e., the 7/ promoter and/or enhancers) to form triple helical structures that prevent 
transcription of the 77 gene in target cells in the body. (See generally, Helene, C. (1991) 

5 Anticancer Drug Des. 6(6):569-84; Helene, C, et aJ. (1 992) Ann, N. Y. Acad Sci. 
660:27-36; and Maher, LJ. (1992) Bioassays 14(12):807-15). 

Likewise, the antisense constructs of the present invention, by antagonizing the 
normal biological activity of one of the 77 proteins, can be used in the manipulation of 
issue, e.g. lipid metabolism, both in vivo and for ex vivo tissue cultures. 

10 Furthermore, like the antisense techniques (e.g. microinjection of antisense 

molecules, or transfection with plasmids whose transcripts are antisense with regard to a 
77mRNA or gene sequence) antagonizing the normal biological activity of one of the 77 
proteins can be used to investigate role of 77 in lipid metabolism. Such techniques can 
be utilized in cell culture, but can also be used in the creation of transgenic animals, as 

15 detailed below. 

Nucleic acid molecules to be used in triple helix formation for the inhibition of 
transcription are preferably single stranded and composed of deoxyribonucleotides. The 
base composition of these oligonucleotides should promote triple helix formation via 
Hoogsteen base pairing rules, which generally require sizable stretches of either purines 

20 or pyrimidines to be present on one strand of a duplex. Nucleotide sequences may be 
pyrimidine-based, which will result in TAT and CGC triplets across the three associated 
strands of the resulting triple helix. The pyrimidine-rich molecules provide base 
complementarity to a purine-rich region of a single strand of the duplex in a parallel 
orientation to that strand. In addition, nucleic acid molecules may be chosen that are 

25 purine-rich, for example, containing a stretch of G residues. These molecules will form 
a triple helix with a DNA duplex that is rich in GC pairs, in which the majority of the 
purine residues are located on a single strand of the targeted duplex, resulting in CGC 
triplets across the three strands in the triplex. 

Alternatively, the potential sequences that can be targeted for triple helix 

30 formation may be increased by creating a so called "switchback" nucleic acid molecule. 
Switchback molecules are synthesized in an alternating 5'-3\ 3'-5* manner, such that they 
base pair with first one strand of a duplex and then the other, eliminating the necessity 
for a sizable stretch of either purines or pyrimidines to be present on one strand of a 
duplex. 

35 Antisense RNA and DNA, ribozyme. and triple helix molecules of the invention 

may be prepared by any method known in the art for the synthesis of DNA and RNA 
molecules. These include techniques for chemically synlhcsizing 
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ol.godeoxyribonucleotides and oligoribonucleotides well known in the art such as for 
example solid phase phosphoramide chemical synthesis. Alternatively, RNA molecules 
may be generated by in vitro and in vivo transcription of DNA sequences encoding the 
annscnse RNA molecule. Such DNA sequences may be incorporated into a wide variety 
of vectors which incorporate suitable RNA polymerase promoters such as the T7 or SP6 
polymerase promoters. Alternatively, anti sense cDNA constructs that synthesize 
annsense RNA constitutively or inducibly, depending on the promoter used, can be 
introduced stably into cell lines. 

Moreover, various well-known modifications to nucleic acid molecules may be 
introduced as a means of increasing intracellular stability and half-life Possible 
mod.ficat.ons include but are not limited to the addition of flanking sequences of 
nbonucleot.des or deoxyribonucleotides to the 5' anoVor 3' ends of the molecule or the 
use of phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages within 
the ohgodeoxyribonucleotide backbone. 

Polypeptides of nV ln W m;^ 

The present invention also makes available isolated 77 polypeptides which are 
•solated from, or otherwise substantially free of other cellular proteins, especially other 
s.gnal transduction factors and/or transcription factors which may normally be 
assocated with the 77 polypeptide. The term "substantially free of other cellular 
protems" (also referred to herein as "contaminating proteins") or "substantially pure or 
punfied preparations" are defined as encompassing preparations of 77 polypeptides 
havmg less than about 20% (by dry weight) contaminating protein, and preferably 
havmg less than about 5% contaminating protein. Functional forms of the subject 
polypeptides can be prepared, for the first time, as purified preparations by using a 
cloned gene as described herein. By "purified", i, is meant, when referring to a peptide 
or DNA or RNA sequence, that the indicated molecule is present in the substantial 
absence of other biological macromolecules, such as other proteins. The term "purified" 
as used herein preferably means at least 80% by dry weight, more preferably in the range 
of 9>99/„ by weight, and most preferably at least 99.8% by weight, of biological 
macromolecules of the same type present (but water, buffers, and other small molecules 
espeaally molecules having a molecular weight of less than 5000. can be present) The ' 
term "pure" as used herein preferably has the same numerical limits as "purified" 
.mediately above. "Isolated" and "purified" do not encompass either natural materials 
•n the.r nat.ve state or natural materials that have been separated into components (e g 
m an acrylamide gel) but not obtained either as pure (e.g. lacking contaminating 
protems, or chromatography reagents such as denaturing agents and polymers e g 
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acrylamide or agarose) substances or solutions. In preferred embodiments, purified 77 
preparations will lack any contaminating proteins from the same animal from which 77 
is normally produced, as can be accomplished by recombinant expression of, for 
example, a human 77 protein in a non-human cell. 
5 Full length proteins or fragments corresponding to one or more particular motifs 

and/or domains or to arbitrary sizes, for example, at least 5, 10, 25, 50, 75, 100, 125, 150 
amino acids in length are within the scope of the present invention. 

For example, isolated 77 polypeptides can be encoded by all or a portion of a 
nucleic acid sequence shown in any of SEQ ID NOs: 1 -6. Isolated peptidyl portions of 

1 0 77 proteins can be obtained by screening peptides recombinantly produced from the 
corresponding fragment of the nucleic acid encoding such peptides. In addition, 
fragments can be chemically synthesized using techniques known in the art such as 
conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, a 77 
polypeptide of the present invention may be arbitrarily divided into fragments of desired 

15 length with no overlap of the fragments, or preferably divided into overlapping 

fragments of a desired length- The fragments can be produced (recombinantly or by 
chemical synthesis) and tested to identify those peptidyl fragments which can function 
as either agonists or antagonists of a wild-type (e.g., "authentic") 77 protein. 

Another aspect of the present invention concerns recombinant forms of the 77 

20 proteins. Recombinant polypeptides preferred by the present invention, in addition to 
native 77 proteins, are encoded by a nucleic acid, which is at least 85% homologous and 
more preferably 90% homologous and most preferably 95 % homologous with a nucleic 
acid sequence represented by SEQ ID NOs: 1 -6. Polypeptides which are encoded by a 
nucleic acid that is at least about 98-99% homologous with the sequence of SEQ ID 

25 NOs: 1-6 are also within the scope of the invention. In a preferred embodiment, a Tl 
protein of the present invention is a mammalian 77 protein. In a particularly preferred 
embodiment a 77 protein is encoded by one of the nucleic acids set forth as SEQ ID 
NOs: 1 -6. In particularly preferred embodiment, a 77 protein has a 77 bioactivity. It will 
be understood that certain post-translational modifications, e.g., phosphorylation and the 

30 like, can increase the apparent molecular weight of the Tl protein relative to the 
unmodified polypeptide chain. 

The present invention further pertains to recombinant forms of one of the subject 
77 polypeptides. Such recombinant Tl polypeptides preferably are capable of 
functioning in one of either role of an agonist or antagonist of at least one biological 

35 activity of a wild-type ("authentic") 77 protein of the appended sequence listing. The 
term "evolutionary related to", with respect to amino acid sequences of 77 proteins, 
refers to both polypeptides having amino acid sequences which have arisen naturally. 
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and also to mutational variants of human 77 polypeptides which are derived, for 

example, by combinatorial mutagenesis. 

In general, polypeptides referred to herein as having an activity (e g are 

"bioactive") of a 77 protein are defined as polypeptides which include an amino acid 

sequence encoded by all or a portion of the nucleic acid sequences shown in one of SEQ 
ID NOs.1-6 and which mimic or antagonize all or a portion of the 
biological/biochemical activities of a naturally occurring 77 protein. In preferred 
embodiments a 77 protein of the present invention specifically interacts with a the 
carboxy terminus (i.e. last 44 amino acids) of the human tub polypeptide. Examples of 
such b.ological activity include the ability to modulate weight control and/or diabetes 
Other b,ological activities of the subject 77 proteins are described herein or will be 
reasonably apparent to those skilled in the art. According to the present invention a 
poly P e P t,dc has biological activity if it is a specific agonist or antagonist of a naturally- 
occurring form of a 77 protein. 

The present invention further pertains to methods of producing the subject 77 
polypeptides. For example, a host cell transfected with a nucleic acid vector directing 
expression of a nucleotide sequence encoding the subject polypeptides can be cultured 
under appropriate conditions to allow expression of the peptide to occur. The cells may 
be harvested, lysed and the protein isolated. A cell culture includes host cells media 
and other byproducts. Suitable media for cell culture are well known in the art The 
recombinant 77 polypeptide can be isolated from cell culture medium, host cells or both 
using techniques known in the art for purifying proteins including ion-exchange 
chromatography, gel filtration chromatography, ultrafiltration, electrophoresis and 
.mmunoaffinity purification with antibodies specific for such peptide. In a preferred 
embodiment, the recombinant 77 polypeptide is a fusion protein containing a domain 
which facilitates its purification, such as GST fusion protein. 

Moreover, it will be generally appreciated that, under certain circumstances it 
may be advantageous to provide homologues of one of the subject TI polypeptides 
which function in a limited capacity as one of either a TI agonist (mimetic) or a TI 
antagonist, in order to promote or inhibit only a subset of the biological activities of the 
naturally-occurring form of the protein. Thus, specific biological effects can be elicited 
by treatment with a homologue of limited function, and with fewer side effects relative 
to treatment with agonists or antagonists which are directed to all of the biological 
activities of naturally occurring forms of 77 proteins. 

Homologues of each of the subject 77 proteins can be generated by mutagenesis 
such as by discrete point mutation(s), or by truncation. For instance, mutation can give' 
rise to homologues which retain substantially the same, or merely a subset of the 
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biological activity of the 77 polypeptide from which it was derived. Alternatively, 
antagonistic forms of the protein can be generated which are able to inhibit the function 
of the naturally occurring form of the protein, such as by competitively binding to a 
downstream or upstream member of the 77 cascade which includes the 77 protein. In 
5 addition, agonistic forms of the protein may be generated which are constitutively 

active. Thus, the 77 protein and homologues thereof provided by the subject invention 
may be either positive or negative regulators of weight control and/or diabetes. 

The recombinant 77 polypeptides of the present invention also include 
homologues of the wild-type 77 proteins, such as versions of those protein which are 

10 resistant to proteolytic cleavage, as for example, due to mutations which alter 
ubiquitination or other enzymatic targeting associated with the protein. 

77 polypeptides may also be chemically modified to create 77 derivatives by 
forming covalent or aggregate conjugates with other chemical moieties, such as glycosyl 
groups, lipids, phosphate, acetyl groups and the like. Covalent derivatives of 77 proteins 

15 can be prepared by linking the chemical moieties to functional groups on amino acid 
sidechains of the protein or at the N-terminus or at the C-tcrminus of the polypeptide. 

Modification of the structure of the subject 77 polypeptides can be for such 
purposes as enhancing therapeutic or prophylactic efficacy, stability (e.g., ex vivo shelf 
life and resistance to proteolytic degradation in vivo), or post-translational modifications 

20 (e.g., to alter phosphorylation pattern of protein). Such modified peptides, when 

designed to retain at least one activity of the naturally-occurring form of the protein, or 
to produce specific antagonists thereof, are considered functional equivalents of the 77 
polypeptides described in more detail herein. Such modified peptides can be produced, 
for instance, by amino acid substitution, deletion, or addition. 

25 For example, it is reasonable to expect that an isolated replacement of a leucine 

with an isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a 
similar replacement of an amino acid with a structurally related amino acid (i.e. isosteric 
and/or isoelectric mutations) will not have a major effect on the biological activity of the 
resulting molecule. Conservative replacements are those that take place within a family 

30 of amino acids that are related in their side chains. Genetically encoded amino acids are 
can be divided into four families: (1) acidic = aspartate, glutamate; (2) basic = lysine, 
arginine, histidinc; (3) nonpolar = alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan; and (4) uncharged polar = glycine, asparaginc. 
glutaminc, cysteine, serine, threonine, tyrosine. In similar fashion, the amino acid 

35 repertoire can be grouped as (1) acidic = aspartate, glutamate; (2) basic = lysine, arginine 
histidine, (3) aliphatic = glycine, alanine, valine, leucine, isoleucine, serine, threonine, 
with serine and threonine optionally be grouped separately as aliphatic-hydroxyl; (4) 
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aromatic = phenylalanine, tyrosine, tryptophan; (5) amide = asparagine, glutamine; and 
(6) sulfur -containing = cysteine and methionine, (see, for example. Biochemistry 2nd 
ed Ed. by L. Stryer, W.H. Freeman and Co., 1981). Whether a change in the amino 
acid sequence of a peptide results in a functional TYhomologue (e.g. functional in the 
5 sense that the resulting polypeptide mimics or antagonizes the wild-type form) can be 
readily determined by assessing the ability of the variant peptide to produce a response 
m cells in a fashion similar to the wild-type protein, or competitively inhibit such a 
response. Polypeptides in which more than one replacement has taken place can readily 
be tested in the same manner. 

10 This invention further contemplates a method for generating sets of 

combmatorial mutants of the subject 77 proteins as well as truncation mutants and is 
especially useful for identifying potential variant sequences (e.g. homologues) that are 
funct.onal ,n modulating signal transduction from a lipid receptor. The purpose of 
screening such combinatorial libraries is to generate, for example, novel 77 homologues 
which can act as either agonists or antagonist, or alternatively, possess novel activities 
all together. To illustrate, 77 homologues can be engineered by the present method to 
prov,de selective, constitutive activation of a tub signaling pathway Thus 
eombinatorially-derived homologues can be generated to have an increased potency 
relative to a naturally occurring form of the protein. 

Likewise, 77 homologues can be generated by the present combinatorial 
approach to selectively inhibit (antagonize) induction by a lipid. For instance 
mutagenesis can provide 77 homologues which are able to bind other signal pathway 
proteins (or DNA) yet prevent propagation of the signal, e.g. the homologues can be 
dominant negative mutants. Moreover, manipulation of certain domains of TI by the 
present method can provide domains more suitable for use in fusion proteins 

In one embodiment, the variegated library of 77 variants is generated by 
combmatorial mutagenesis at the nucleic acid level, and is encoded by a variegated gene 
.brary. For example, a mixture of synthetic oligonucleotides can be enzymatically 
hgated into gene sequences such that the degenerate set of potential TI sequences are 
express.ble as individual polypeptides, or alternatively, as a set of larger fusion proteins 
(e.g. for phage display) containing the set of 77 sequences therein 

There are many ways by which such libraries of potential 77 homologues can be 
generated from a degenerate oligonucleotide sequence. Chemical synthesis of a 
degenerate gene sequence can be carried out in an automatic DNA synthesizer and the 
synthetic genes then ligated into an appropriate expression vector. The purpose of a 
degenerate set of genes is to provide, in one mixture, all of the sequences encoding the 
desired set of potential TI sequences. The synthesis of degenerate oligonucleotides is 
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well known in the art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et 
al. (1981) Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. AG 
Walton, Amsterdam: Elsevier pp 273-289; Itakura et al. (1984) Annu. Rev. Biochem. 
53:323; Itakura etal. (1984) Science 198:1056; Ikeet al. (1983) Nucleic Acid Res. 
5 1 1 :477. Such techniques have been employed in the directed evolution of other proteins 
(see, for example, Scott et al. (1 990) Science 249:386-390; Roberts et al. ( 1 992) Proc. 
Natl Acad. Sci. USA 89:2429-2433; Devlin et al. (1990) Science 249: 404-406; Cwirla 
et al. (1990) Proc. Natl Acad. Sci. USA 87: 6378-6382; as well as U.S. Patents Nos. 
5,223,409, 5,198,346, and 5,096,815). 

10 Likewise, a library of coding sequence fragments can be provided for a 77 clone 

in order to generate a variegated population of 77 fragments for screening and 
subsequent selection of bioactive fragments. A variety of techniques are known in the 
art for generating such libraries, including chemical synthesis. In one embodiment, a 
library of coding sequence fragments can be generated by (i) treating a double stranded 

1 5 PCR fragment of a 77 coding sequence with a nuclease under conditions wherein nicking 
occurs only about once per molecule; (ii) denaturing the double stranded DNA; (iii) 
renaturing the DNA to form double stranded DNA which can include sense/antisense 
pairs from different nicked products; (iv) removing single stranded portions from 
reformed duplexes by treatment with SI nuclease; and (v) ligating the resulting fragment 

20 library into an expression vector. By this exemplary method, an expression library can 
be derived which codes for N-terminal, C-terminal and internal fragments of various 
sizes. 

A wide range of techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 

25 libraries for gene products having a certain property. Such techniques will be generally 
adaptable for rapid screening of the gene libraries generated by the combinatorial 
mutagenesis of 77 homologues. The most widely used techniques for screening large 
gene libraries typically comprises cloning the gene library into replicable expression 
vectors, transforming appropriate cells with the resulting library of vectors, and 

30 expressing the combinatorial genes under conditions in which detection of a desired 
activity facilitates relatively easy isolation of the vector encoding the gene whose 
product was detected. Each of the illustrative assays described below are amenable to 
high through-put analysis as necessary to screen large numbers of degenerate 77 
sequences created by combinatorial mutagenesis techniques. 

35 In one embodiment, cell based assays can be exploited to analyze the variegated 

77 library. For instance, the library of expression vectors can be transfected into a cell 
line ordinarily responsive to insulin. The transfected cells are then contacted with the 
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insulin and the effect of the 77 mutant on signaling by a Y5 receptor can be detected. 
Plasmid DNA can then be recovered from the cells which score for inhibition, or 
alternatively, potentiation of lipid receptor induction, and the individual clones further 
characterized. 

CombinatoriaJ mutagenesis has a potential to generate very large libraries of 
mutant proteins, e.g., in the order of 1026 molecules. Combinatorial libraries of this size 
may be technically challenging to screen even with high throughput screening assays. 
To overcome this problem, a new technique has been developed recently, recrusive 
ensemble mutagenesis (REM), which allows one to avoid the very high proportion of 
non-functional proteins in a random library and simply enhances the frequency of 
functional proteins, thus decreasing the complexity required to achieve a useful 
sampling of sequence space. REM is an algorithm which enhances the frequency of 
functional mutants in a library when an appropriate selection or screening method is 
employed (Arkin and Yourvan (1992) Proc. Natl. Acad. Sci. USA 89:781 1-7815- 
Yourvan et al. (1992) Parallel Problem Solving from Nature,!., In Maenner and 
Manderick, eds.. Elsevier Publishing Co., Amsterdam, pp. 401-410; Delgrave et al. 
( 1 993) Protein Engineering 6(3):327-33 1 ). 

The invention also provides for reduction of the 77 proteins to generate mimetics, 
e.g. peptide or non-peptide agents, which are able to disrupt binding of a 77 polypeptide 
of the present invention with either upstream or downstream components of a lipid 
uptake signaling cascade, such as binding proteins or interactors. Thus, such mutagenic 
techniques as described above are also useful to map the determinants of the 77 proteins 
which participate in protein-protein interactions involved in, for example, binding of the 
subject 77 polypeptide to proteins which may function upstream (including both 
activators and repressors of its activity) or to proteins or nucleic acids which may 
function downstream of the TI polypeptide, whether they are positively or negatively 
regulated by it, for example. To illustrate, the critical residues of a subject 77 
polypeptide which are involved in molecular recognition of, for example, tub or other 
components upstream or downstream of a 77 can be determined and used to generate TI- 
derived peptidomimetics which competitively inhibit binding of the authentic TI protein 
with that moiety. By employing, for example, scanning mutagenesis to map the amino 
acid residues of each of the subject TI proteins which are involved in binding other 
extracellular proteins, peptidomimetic compounds can be generated which mimic those 
residues of the TI protein which facilitate the interaction. Such mimetics may then be 
35 used to interfere with the normal function of a TI protein. For instance, non- 

hydrolyzable peptide analogs of such residues can be generated using benzodiazepine 
(e.g., see Freidingcr et al. in Peptides: Chemistry and Biology,, G.R. Marshall ed.. 
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ESCOM Publisher: Leiden, Netherlands, 1988), azepine (c.g M see Huffman et al. in 
Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, 
Netherlands, 1988), substituted gamma lactam rings (Garvey et aL in Peptides: 
Chemistry and Biology y G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 
5 1988), keto-methylene pseudopeptides (Ewenson et al. (1986) J. Med. Chem. 29:295; 
and Ewenson et al. in Peptides: Structure and Function (Proceedings of the 9th 
American Peptide Symposium) Pierce Chemical Co. Rockland, IL, 1985), b-turn 
dipeptide cores (Nagai et al. ( 1 985) Tetrahedron Lett 26:647; and Sato et al. ( 1 986) J. 
Chem. Soc. Perkin. Trans. 1:1231), and b-aminoalcohols (Gordon ct al. (1985) Biochem. 
10 Biophys. Res. Commun. 126:419; and Dann et al. (1986) Biochem. Biophys. Res. 
Commun. 134:71). 

Cells Expressing Recombinant TI polypeptides. 

This invention also pertains to host cells transfected to express a recombinant 

15 form of the subject TI polypeptides. The host cell may be any prokaryotic or cukaryotic 
cell. Thus, a nucleotide sequence derived from the cloning of mammalian TI proteins, 
encoding all or a selected portion of the full-length protein, can be used to produce a 
recombinant form of a TI polypeptide via microbial or eukaryotic cellular processes. 
Ligating the polynucleotide sequence into a gene construct, such as an expression 

20 vector, and transforming or transfecting into hosts, either eukaryotic (yeast, avian, insect 
or mammalian) or prokaryotic (bacterial cells), are standard procedures used in 
producing other well-known proteins, e.g. MAP kinase, p53, WT1, PTP phosphatases, 
SRC, and the like. Similar procedures, or modifications thereof, can be employed to 
prepare recombinant 77 polypeptides by microbial means or tissue-culture technology in 

25 accord with the subject invention. 

The recombinant 77 genes can be produced by ligating a nucleic acid encoding a 
TI protein, or a portion thereof, into a vector suitable for expression in either prokaryotic 
cells, eukaryotic cells, or both. Expression vectors for production of recombinant forms 
of the subject TI polypeptides include plasmids and other vectors. For instance, suitable 

30 vectors for the expression of a TI polypeptide include plasmids of the types: pBR322- 
derived plasmids, pEMBL-dcrived plasmids, pEX-derivcd plasmids, pBTac-derived 
plasmids and pUC-derived plasmids for expression in prokaryotic cells, such as E. coli. 

A number of vectors exist for the expression of recombinant proteins in yeast. 
For instance, YEP24, YIPS, YEP51, YEP52, pYES2, and YRP17 are cloning and 

35 expression vehicles useful in the introduction of genetic constructs into S. cerevisiae 
(see, for example. Broach et al. (1983) in Experimental Manipulation of Gene 
Expression, ed. M. Inouye Academic Press, p. 83, incorporated by reference herein). 
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These vectors can replicate in E. coli due the presence of the pBR322 ori and in 9. 
cerevisiae due to the replication determinant of the yeast 2 micron plasmid. In addition 
drug resistance markers such as ampicillin can be used. In an illustrative embodiment a 
77 polypept.de is produced recombinant^ utilizing an expression vector generated by 
5 sub-cloning the coding sequence of one of the 77 genes represented in SEQ ID NOs l 3 
5, 7, or 9. ' ' 

The preferred mammalian expression vectors contain both prokaryotic 
sequences, to facilitate the propagation of the vector in bacteria, and one or more 
eukaryot.c transcription units that are expressed in eukaryotic cells. The pcDNAI/amp 
' pcDNAI/neo, pRc/CMV, P SV2gpt, pSV2neo, P SV2-dhfr, P Tk2, pRSVneo, pMSG ' 
PSVT7, PKO-neo and pHyg derived vectors are examples of mammalian expression 
vectors suitable for transfection of eukaryotic cells. Some of these vectors are modified 
with sequences from bacterial plasmids, such as pBR322, to facilitate replication and 
drug resistance selection in both prokaryotic and eukaryotic cells. Alternatively 
denvatives of viruses such as the bovine papillomavirus (BPV-I), or Epstein-Barr virus 
(pHEBo, pREP-derived and P 205) can be used for transient expression of proteins in 
eukaryotic cells. The various methods employed in the preparation of the plasmids and 
transformation of host organisms are well known in the art. For other suitable 
expression systems for both prokaryotic and eukaryotic cells, as well as general 
recombinant procedures, see Molecular Cloning A Laboratory Manual, 2nd Ed cd by 
Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1 989) Chapters 
1 6 and 17. 

In some instances, it may be desirable to express the recombinant 77 polypeptide 
by the use of a baculovirus expression system. Examples of such baculovirus expression 
systems include pVL-derived vectors (such as pVL 1 392, pVL 1 393 and pVL94 1 ) 
pAcUW-derived vectors (such as pAcUWl), and pBIueBac-derived vectors (such as the 
B-gal containing pBlueBac III). 

When it is desirable to express only a portion of a Tl protein, such as a form 
lackmg a port.on of the N-terminus, i.e. a truncation mutant which lacks the signal 
peptide, ,t may be necessary to add a start codon (ATG) to the oligonucleotide fragment 
conta.n.ng the desired sequence to be expressed. It is well known in the art that a 
meth.on,ne at the N-terminal position can be enzymatically cleaved by the use of the 
enzyme methionine aminopeptidase (MAP). MAP has been cloned from E coli (Ben- 
Bassatetal. (.987); Bacterial. 1 69:75 1 -757) and Salmonella typhimurium and i,s/„ 
v,tro act.v.ty has been demonstrated on recombinant proteins (Miller et al (1 987) Proc 
Natl. Acad. Sci. 84:271 8-1 722). Therefore, removal of an N-terminal methionine, if 
des.red. can be achieved either /* v/vo by expressing 77-derived polypeptides in a host 
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which produces MAP (e.g., £. coli or CM89 or S. cerevisiae), or in vitro by use of 
purified MAP (e.g., procedure of Miller et al., supra). 

In other embodiments transgenic animals, described in more detail below could 
be used to produce recombinant proteins. 

5 

Fusion Proteins and Immunogens. 

In another embodiment, the coding sequences for the polypeptide can be 
incorporated as a part of a fusion gene including a nucleotide sequence encoding a 
different polypeptide. This type of expression system can be useful under conditions 

10 where it is desirable to produce an immunogenic fragment of a 77 protein. For example, 
the VP6 capsid protein of rotavirus can be used as an immunologic carrier protein for 
portions of the 77 polypeptide, either in the monomelic form or in the form of a viral 
particle. The nucleic acid sequences corresponding to the portion of a subject 77 protein 
to which antibodies are to be raised can be incorporated into a fusion gene construct 

1 5 which includes coding sequences for a late vaccinia virus structural protein to produce a 
set of recombinant viruses expressing fusion proteins comprising 77 epitopes as part of 
the virion. It has been demonstrated with the use of immunogenic fusion proteins 
utilizing the hepatitis b surface antigen fusion proteins that recombinant hepatitis b 
virions can be utilized in this role as well. Similarly, chimeric constructs coding for 

20 fiision proteins containing a portion of a 77 protein and the poliovirus capsid protein can 
be created to enhance immunogenicity of the set of polypeptide antigens (see, for 
example, EP Publication No: 0259149; and Evans et al. (1989) Nature 339:385; Huang 
et al. (1988) J. Virol 62:3855; and Schlienger et al. (1992) J. Virol 66:2). 

The Multiple Antigen Peptide system for peptide-based immunization can also 

25 be utilized to generate an immunogen, wherein a desired portion of a TI polypeptide is 
obtained directly from organo-chemical synthesis of the peptide onto an oligomeric 
branching lysine core (see, for example, Posnett et al. (1988) J. Biol Chem. 263:1719 
and Nardelli et al. (1992) J. Immunol 148:914). Antigenic determinants of TI proteins 
can also be expressed and presented by bacterial cells. 

30 In addition to utilizing fusion proteins to enhance immunogenicity, it is widely 

appreciated that fusion proteins can also facilitate the expression of proteins, and 
accordingly, can be used in the expression of the TI polypeptides of the present 
invention. For example, TI polypeptides can be generated as glutathione-S-transferasc 
(GST- fusion) proteins. Such GST- fusion proteins can enable easy purification of the TI 

35 polypeptide, as for example by the use of glutathione-deri vatized matrices (see, for 
example. Current Protocols in Molecular Biology, eds. Ausubel et al. (John Wiley & 
Sons, NY 1991)). 
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In another embodiment, a fusion gene coding for a purification leader sequence 
such as a poly-(His)/enterokinase cleavage site sequence at the N-terminus of the desired 
portaon of the recombinant protein, can allow purification of the expressed fusion protein 
by affimty chromatography using a Ni2+ metal resin. The purification leader sequence 
> can then be subsequently removed by treatment with enterokinase to provide the purified 

Techniques for making fusion genes are known to those skilled in the art 
Essentially, the joining of various DNA fragments coding for different polypeptide 
sequences is performed in accordance with conventional techniques, employing blunt- 
ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for 
approbate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase 
treatment to avoid undesirable joining, and enzymatic ligation. In another embodiment 
the fus,on gene can be synthesized by conventional techniques including automated 
DNA synthesizers. Alternatively, PGR amplification of gene fragments can be carried 
out using anchor primers which give rise to complementary overhangs between two 
consecut.ve gene fragments which can subsequently be annealed to generate a chimeric 
gene sequence (see, for example, Current Protocols in Molecular Biology, eds Ausubel 
et al. John Wiley & Sons: 1992). 

Antibodies 

Another aspect of the invention pertains to an antibody specifically reactive with 
a mammalian Tl protein. For example, by using immunogens derived from a 7Y protein 
eg. based on the cDNA sequences, anti-prote.n/anti-peptide antisera or monoclonal ' 
anybodies can be made by standard protocols (See, for example, Antibodies A 
Laboratory Manual ed. by Harlow and Lane (Cold Spring Harbor Press 1988)) A 
mammal, such as a mouse, a hamster or rabbit can be immunized with an immunogenic 
form of the peptide (e.g., a mammalian 77 polypeptide or an antigenic fragment which is 
capable of eliciting an antibody response, or a fusion protein as described above) 
Techmques for conferring immunogenicity on a protein or peptide include conjugation 
to earners or other techniques well known in the art. An immunogenic portion of a Tl 
protein can be administered in the presence of adjuvant. The progress of immunization 
can be monitored by detection of antibody titers in plasma or serum. Standard ELISA or 
other immunoassays can be used with the immunogen as antigen to assess the levels of 
ant.bod.es. In a preferred embodiment, the subject antibodies are monospecific for 
anfgenic determinants of a 77 protein of a mammal, e.g. antigenic determinants of a 
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protein encoded by SEQ ID NOs:l-6 or closely related homologies (e.g. at least 90% 
homologous, and more preferably at least 94% homologous). 

Following immunization of an animal with an antigenic preparation of a 77 
polypeptide, anti-77 antisera can be obtained and, if desired, polyclonal anti-77 
5 antibodies isolated from the serum. To produce monoclonal antibodies, antibody- 
producing cells (lymphocytes) can be harvested from an immunized animal and fused 
by standard somatic cell fusion procedures with immortalizing cells such as myeloma 
cells to yield hybridoma cells. Such techniques are well known in the art, and include, 
for example, the hybridoma technique (originally developed by Kohler and Milstein, 
10 (1 975) Nature 256: 495-497), the human B cell hybridoma technique (Kozbar et al. 

(1983) Immunology Today 4:72), and the EBV-hybridoma technique to produce human 
monoclonal antibodies (Cole et al., (1985) Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc. pp. 77-96). Hybridoma cells can be screened 
immunochemically for production of antibodies specifically reactive with a 
1 5 mammalian 77 polypeptide of the present invention and monoclonal antibodies isolated 
from a culture comprising such hybridoma cells. In one embodiment anti-human 77 
antibodies specifically react with any of the proteins encoded by the DNA of ATCC 
deposit Nos. 98125- 98128. 

The term "antibody" as used herein is intended to include fragments thereof 
20 which are also specifically reactive with one of the subject mammalian 77 polypeptides. 
Antibodies can be fragmented using conventional techniques and the fragments screened 
for utility in the same manner as described above for whole antibodies. For example, 
F(ab)2 fragments can be generated by treating antibody with pepsin. The resulting 
F(ab)2 fragment can be treated to reduce disulfide bridges to produce Fab fragments. 
25 The antibody of the present invention is further intended to include bispecific, single- 
chain and chimeric molecules having affinity for a 77 protein conferred by at least one 
CDR region of the antibody. In preferred embodiments, the antibody further comprises 
a label attached thereto and able to be detected, (e.g. the label can be a radioisotope, 
fluorescent compound, enzyme or enzyme co-factor). 
30 Antibodies which specifically bind 77 epitopes can also be used in 

immunohistochemical staining of tissue samples in order to evaluate the abundance and 
pattern of expression of each of the subject Tl polypeptides. Anti-77 antibodies can be 
used diagnostically in immuno-precipitation and immuno-blotting to detect and evaluate 
Tl protein levels in tissue as part of a clinical testing procedure. For instance, such 
35 measurements can be useful in predictive valuations of the onset or progression of 

proliferative disorders. Likewise, the ability to monitor 7/ protein levels in an individual 
can allow determination of the efficacy of a given treatment regimen for an individual 
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afflicted with such a disorder. The leve. of 77 polypeptides may be measured from cells 
m bod.ly fluid, such as in samples of cerebral spinal fluid, such as produced by biopsy 
D,agnost,c assays using ami- 7V antibodies can include, for example, immunoassays 
designed to aid in early diagnosis of a degenerative disorder. Diagnostic assays using 
5 anti- TV polypeptide antibodies can also include immunoassays designed to aid in early 
diagnosis and phenotyping neoplasic or hyperplastic disorders. 

Another application of anti-TV antibodies of the present invention is in the 
.mmunologica, screening of cDNA libraries constructed in expression vectors such as X 
gtl 1, *gt,8-23, XZAP, and A.ORF8. Messenger libraries of this type, having coding 
sequences inserted in the correct reading frame and orientation, can produce fusion 
protem. For instance, Xgt, , wi„ produce fusion proteins whose amino termini consist 
of B-ga.actos.dase ammo acid sequences and whose carboxy termini consist of a foreign 
po.ypept.de. Antigenic epitopes of a TVprotein, e.g. other orthologues of a particular 77 
pro e, n or other paralogues from the same species, can then be detected with antibodies 
as, for example, reacting nitrocellulose filters lifted from infected plates with anti-77 ' 
anttbod.es. Positive phage detected by this assay can then be isolated from the infected 
Plate. Thus, the presence of TI homologues can be detected and cloned from other 
ammals, as can alternate isoforms (including splicing variants) from humans. 

20 Methods of Treating rw a «> 

There are a wide variety of disorders for which T7 molecules of the present 
■nvention can be used in treatment. As discussed herein 77 molecule can increase the 
transcnpt.on or activity of 77 molecules in a cell. A 77 molecule therapeutic can be as 
appropnate, any of the preparations described above, including isolated polypeptides 
gene therapy constructs, antisense molecules, peptidomimetics or agents identifled in'the 
drug assays provided herein. 

In preferred embodiments the subject 77 molecules are modulated to control 
weight m a subject. Hypothalamic neuropeptide Y (NPY) is a member of the pancreatic 
poly^pt,defami.y and is a potent feeding signal. NPY levels in the paraventricular 
nucleus (PVN) of the brain have been show, to increase with food deprivation and 
return to normal after insulin injections (Sahu et al. ( 1 995) Endocrinology 136 57 1 8) In 
one embodiment the subject T7 molecu.es are modulated to contro. weight in a subject 
by modular™ of a biochemicaJ pathway involving NPY. NPY is thought to signal via 
the Y5 receptor (Gera.d et a.. (1996) Nature 382:168). The distribution of Y5 mRNA 
shows that the Y5 receptor is also involved in regulating the emotional aspect of 
appet,t,ve behavior, ,„ another embodiment the subject 77molecu.es are modulated to 
control wc.ght by modulation of a biochemical pathway involving the Y5 receptor 
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Insulin regulates food intake by altering NPY expression in the hypothalamus of 
the brain (Schwartz et al. (1992) Endocr Rev. 13:387). Insulin deficiency, which can be 
caused, for example, by diabetes, is thought to lead to increased NPY expression in the 
hypothalamus and to the hyperphagia characteristic of uncontrolled type I diabetes 

5 (Sipols et al. (1995) Diabetes 44:147). In one embodiment the subject 77 molecules are 
modulated to control weight in a subject by modulation of a biochemical pathway 
involving insulin. In another embodiment, obesity is controlled by modulation of a 
biochemical pathway involving insulin-like growth factor II (IGF-II). 

In other embodiments, the subject 77 molecules are modulated to affect a 

10 bioactivity of tub in order to effect a treatment for weight control. In a preferred 
embodiment the subject 77 molecules are modulated to control obesity, diabetes, or 
cachexia. 

In still other embodiments, the subject 77 molecules are modulated to control 
apoptosis in a cell. Apoptosis, or programmed cell death, is characterized by distinct 

1 5 morphological changes and can be triggered by a variety of mechanisms. Certain 
apoptosis-inducing agents stimulate sphingomyelinases, which act on sphingolipids 
resulting in the generation of phosphocholine and ceramide, a key regulator of ceil cycle 
control and apoptosis (Pushkareva et al. (1995) Immunology Today 16:295). Ceramide 
is thought to act as a second messenger since a soluble analog of ceramide mimics the 

20 affects of agents that induce ceramide production (Law and Rossie ( 1 995) J. Biol Chem. 
270:12808). Ceramide is thought to control apoptosis via its interaction with the protein 
phosphatase 2A (PP2A) family of serine/threonine protein phosphatases (Hannun (1994) 
J, Biol Chem. 269:3125). The catalytic subunit of PP-2A has been shown to be 
activated by Ceramide (Law and Rossie, supra), 

25 In a preferred embodiment the subject TI molecules are modulated to control 

apoptosis in a cell of the PVN of the brain. In one embodiment modulation of the 
molecules to control apoptosis in the PVN of the brain leads to one or more of weight 
control and diabetes in a subject. 

In one embodiment apoptosis is modulated by modulating the activity of TI-1 in 

30 a cell. In yet another embodiment apoptosis is modulated by modulating the activity of 
TI-2 in a cell. In still another embodiment apoptosis is modulated by modulating the 
activity of TI-3 in a cell. In another embodiment apoptosis is modulated by modulating 
TI-4 activity in a cell. In addition, therapy may involve modulation of any combination 
of the disclosed 77 molecules. 

35 The present invention will also be useful in treating neurodegenerative diseases 

which are characterized by apoptosis, including Alzheimer ! s disease. Parkinson's 
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disease, Huntington's chorea, amyotrophic lateral sclerosis and the like, as well as 
spinocerebellar degenerations. 

In another embodiment the present invention can be used to modulate a pathway 
involving integrin-mediated signaling. 

In another embodiment the subject 77 molecules are modulated to control cell 
cycle progression. Entry of cells into mitosis characteristically involves coordinated 
and simultaneous events, which include, for example, cytoskeletal rearrangements 
disassembly of the nuclear envelope and of the nucleoli, and condensation of chromatin 
into chromosomes. Cell-cycle events are thought to be regulated by a series of 
interdependent biochemical steps, with the initiation of late events requiring the 
successful completion of those proceeding them. In eukaryotic cells mitosis does not 
normally take place until the Gl, S and G2 phases of the cell-cycle are completed For 
instance, at least two stages in the cell cycle are regulated in response to DNA damage 
*e Gl/S and the G2/M transitions. These transitions serve as checkpoints to which ceils 
delay cell-cycle progress to allow repair of damage before entering either S phase, when 
damage would be perpetuated, or M phase, when breaks would result in loss of genomic 
material. Both the Gl/S and G2/M checkpoints are known to be under genetic control as 
there are mutants that abolish arrest or delay which ordinarily occur in wild-type cells in 
response to DNA damage. 

Tumor suppressors have also been linked to cell cycle control. For example 
both P 53 (Green (1989) Cell 56:1-3; Mowat et al (1985)^, 314:633-636) and the 
retinoblastoma gene produce (Rb) have been linked to cell cycle control. The first firm 
ev.dence for a specific biochemical link between P 53 and the cell cycle comes a finding 
that P 53 apparently regulates expression of a second protein, P 21, which inhibits cyclin- 
dependent kinases (cdks) needed to drive cells through the cell cycle, e.g. from Gl into 
S phase (Xiong et al. (1993) Nature 366:701-704). C6 ceramide has been shown to 
cause dephosphorylation of Rb and Rb deficient cells are more resistant to ceramide- 
induced growth suppression (Pushkareva et al. supra). 

In one embodiment cell cycle progression is modulated by modulating the 
activity of TI-I in a cell. In yet another embodiment ceil cycle progression is modulated 
by modulating the activity of TI-2 in a cell. In still another embodiment cell cycle 
progress,on ,s modulated by modulating the activity of TI-3 in a cell. In another 
embodiment cell cycle progression is modulated by modulating TI-4 in a cell In 
add.uon, therapy may involve modulation of any combination of the disclosed 77 
35 molecules. 

Since, in some cases, genes may be upregulated in a disease state and in other 
cases may be suppressed, it will be desirable to activate and/or potentiate or suppress 
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and/or downmodulate 77 bioactivity depending on the condition to be treated using the 
techniques compounds and methods described herein. 

Among the approaches which may be used to ameliorate disease symptoms 
involving the misexpression of a 77 gene are, for example, antisense, ribozyme, and 

5 triple helix molecules described above. Compounds that compete with an 77 protein for 
binding to upstream or downstream elements in a lipid uptake signaling cascade will 
antagonize a 77 protein, thereby inducing a therapeutic effect. Examples of suitable 
compounds include the antagonists or homologues described in detail above. In other 
instances, the increased expression or bioactivity of a 77 protein may be desirable and 

10 may be accomplished by, for example the use of the 77 agonists or mimetics or by gene 
replacement therapy, as described herein. 

Compounds identified as increasing or decreasing 77 gene expression or protein 
activity can be administered to a subject at therapeutically effective dose to treat the 
diseases described herein. A therapeutically effective dose refers to that amount of the 

15 compound sufficient to effect a change in a 77-associated disorder, such as abnonr .! 
weight control and/or diabetes. 

Effective Dose 

Toxicity and therapeutic efficacy of such compounds can be determined by 

20 standard pharmaceutical procedures in cell cultures or experimental animals, e.g. , for 
determining The LD 50 (the dose lethal to 50% of the population) and the ED 50 (the dose 
therapeutically effective in 50% of the population). The dose ratio between toxic and 
therapeutic effects is the therapeutic index and it can be expressed as the ratio 
LD50/ED50. Compounds which exhibit large therapeutic indices are preferred. While 

25 compounds that exhibit toxic side effects may be used, care should be taken to design a 
delivery system that targets such compounds to the site of affected tissue in order to 
minimize potential damage to uninfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in 
formulating a range of dosage for use in humans. The dosage of such compounds lies 

30 preferably within a range of circulating concentrations that include the ED 50 with little 
or no toxicity. The dosage may vary within this range depending upon the dosage form 
employed and the route of administration utilized. For any compound used in the 
method of the invention, the therapeutically effective dose can be estimated initially 
from cell culture assays. A dose may be formulated in animal models to achieve a 

35 circulating plasma concentration range that includes the IC 50 (Le. 9 the concentration of 
the test compound which achieves a half-maximal inhibition of symptoms) as 
determined in cell culture. Such information can be used to more accurately determine 
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useful doses in humans. Levels in plasma may be measured, for example, by high 
performance liquid chromatography. 



Formulation and I Up 

5 Pharmaceutical compositions for use in accordance with the present invention 

may be formulated in conventional manner using one or more physiologically acceptable 
earners or excipients. Thus, the compounds and their physiologically acceptable salts 
and solvates may be formulated for administration by, for example, injection, inhalation 
or insufflation (either through the mouth or the nose) or oral, buccal, parenteral or rectal 
10 administration. 

For such therapy, the oligomers of the invention can be formulated for a variety 
of loads of administration, including systemic and topical or localized administration 
Techniques and formulations generally may be found in Remminffan's Pharmaceutical 
Sciences, Meade Publishing Co., Easton, PA. For systemic administration, injection is 
15 preferred, mcludmg intramuscular, intravenous, intraperitoneal, and subcutaneous For 
injection, the oligomers of the invention can be formulated in liquid solutions, preferably 
in physiologically compatible buffers such as Hank's solution or Ringer's solution In 
addition, the oligomers may be formulated in solid form and redissolved or suspended 
immediately prior to use. Lyophilized forms are also included. 
10 For oral administration, the pharmaceutical compositions may take the form of, 

for example, tablets or capsules prepared by conventional means with pharmaceutical^ 
acceptable excipients such as binding agents (e.g., pregelatinized maize starch, 
polyvinylpyrrolidone or hydroxypropyl methylcellulose); fillers (e g lactose ' 
microcrystalline cellulose or calcium hydrogen phosphate); lubricants (e.g. magnesium 
5 stearate, talc or silica); disintegrants (e.g., potato starch or sodium starch glycolate)- or 
wetting agents (e.g., sodium lauryl sulphate). The tablets may be coated by methods 
well known ,n the art. Liquid preparations for oral administration may take the form of, 
for example, solutions, syrups or suspensions, or they may be presented as a dry produci 
for constitution with water or other suitable vehicle before use. Such liquid preparations 
may prepared bv c °nventional means with pharmaceutical^ acceptable additives 
such as suspending agents (e.g., sorbitol syrup, cellulose derivatives or hydrogenated 
edible fats); emulsifying agents (e.g., lecithin or acacia); non-aqueous vehicles (e g. 
ationd oil, oily esters, ethyl alcohol or fractionated vegetable oils); and preservatives 
(e.g., methyl or propyl-p-hydroxybenzoates or sorbic acid). The preparations may also 
• contain buffer salts, flavoring, coloring and sweetening agents as appropriate. 

Preparations for oral administration may be suitably formulated to give 
controlled release of the active compound. 
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For buccal administration the compositions may take the form of tablets or 
lozenges formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present 
invention are conveniently delivered in the form of an aerosol spray presentation from 

5 pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., 

dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon 
dioxide or other suitable gas. In the case of a pressurized aerosol the dosage unit may be 
determined by providing a valve to deliver a metered amount. Capsules and cartridges 
of e .g. gelatin for use in an inhaler or insufflator may be formulated containing a powder 

10 mix of the compound and a suitable powder base such as lactose or starch. 

The compounds may be formulated for parenteral administration by injection, 
e.g., by bolus injection or continuous infusion. Formulations for injection may be 
presented in unit dosage form, e.g.. in ampoules or in multi-dose containers, with an 
added preservative. The compositions may take such forms as suspensions, solutions or 

1 5 emulsions in oily or aqueous vehicles, and may contain formulatory agents such as 
suspending, stabilizing and/or dispersing agents. Alternatively, the active ingredient 
may be in powder form for constitution with a suitable vehicle, e.g., sterile pyrogen-frcc 
water, before use. 

The compounds may also be formulated in rectal compositions such as 
20 suppositories or retention enemas, e.g., containing conventional suppository bases such 
as cocoa butter or other glycerides. 

In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered 
by implantation (for example subcutaneously or intramuscularly) or by intramuscular 
25 injection. Thus, for example, the compounds may be formulated with suitable 

polymeric or hydrophobic materials (for example as an emulsion in an acceptable oil) or 
ion exchange resins, or as sparingly soluble derivatives, for example, as a sparingly 
soluble salt. 

Systemic administration can also be by transmucosal or transdermal means. For 
30 transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants arc generally known in the art, 
and include, for example, for transmucosal administration bile salts and fusidic acid 
derivatives, in addition, detergents may be used to facilitate permeation. Transmucosal 
administration may be through nasal sprays or using suppositories. For topical 
35 administration, the oligomers of the invention are formulated into ointments, salves, 
gels, or creams as generally known in the art. 
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In clinical settings, the gene delivery systems for the therapeutic 77 gene can be 
introduced into a patient by any of a number of methods, each of which is familiar in the 
art. For instance, a pharmaceutical preparation of the gene delivery system can be 
introduced systemically, e.g. by intravenous injection, and specific transduction of the 
protein in the target cells occurs predominantly from specificity of transfection provided 
by the gene delivery vehicle, cell-type or tissue-type expression due to the 
transcriptional regulatory sequences controlling expression of the receptor gene, or a 
combination thereof. In other embodiments, initial delivery of the recombinant gene is 
more limited with introduction into the animal being quite localized. For example, the 
gene delivery vehicle can be introduced by catheter (see U.S. Patent 5,328,470) or by 
stereotactic injection (e.g. Chen et al. (1994) Proc. Natl. Acad. Sci. USA 91: 3054-3057). 
A 77 gene, such as any one of the sequences represented in the group consisting of SEQ 
ID NOs:l-6 or a sequence homologous thereto can be delivered in a gene therapy 
construct by electroporation using techniques described, for example, by Dev et al. 
(( 1 994) Cancer Treat. Re v. 20: 1 05- 1 1 5). 

The pharmaceutical preparation of the gene therapy construct can consist 
essentially of the gene delivery system in an acceptable diluent, or can comprise a slow 
release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the 
complete gene delivery system can be produced intact from recombinant cells, e.g. 
retroviral vectors, the pharmaceutical preparation can comprise one or more cells which 
produce the gene delivery system. 

The compositions may, if desired, be presented in a pack or dispenser device 
which may contain one or more unit dosage forms containing the active ingredient. The 
pack may for example comprise metal or plastic foil, such as a blister pack. The pack or 
25 dispenser device may be accompanied by instructions for administration. 

Diagnostic and Prognostic Assays 

In the diagnostic and prognostic assays described herein, in addition to the 77 
nucleic acid molecules and polypeptides described above, the present invention provides 
for the use of a nucleic acid comprising at least a portion of a 77 nucleic acid molecule, 
for example, at least a portion of a nucleic acid sequence shown in SEQ ID NOs: 1 - 6 or 
polypeptides encoded by at least a portion of the nucleic acid sequence shown in SEQ 



20 



30 



35 



The present method provides a method for determining if a subject is at risk for a 
disorder characterized by apoptosis or aberrant cell proliferation. In preferred 
embodiments, the methods can be characterized as comprising detecting, in a sample of 
cells from the subject, the presence or absence of a genetic lesion characterized by at 
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least one of (i) an alteration affecting the integrity of a gene encoding a 77-protein. or (ii) 
the mis-expression of the 77 gene. To illustrate, such genetic lesions can be detected by 
ascertaining the existence of at least one of (i) a deletion of one or more nucleotides 
from a 77 gene, (ii) an addition of one or more nucleotides to a 77 gene, (iii) a 
5 substitution of one or more nucleotides of a 77 gene, (iv) a gross chromosomal 
rearrangement of a 77 gene, (v) a gross alteration in the level of a messenger RNA 
transcript of a 77 gene, (vii) aberrant modification of a 77 gene, such as of the 
methylation pattern of the genomic DNA, (vii) the presence of a non-wild type splicing 
pattern of a messenger RNA transcript of a 77 gene, (viii) a non-wild type level of a 77- 

1 0 protein, (ix) allelic loss of a 77 gene, and (x) inappropriate post-translational 

modification of a 77-protein. As set out below, the present invention provides a large 
number of assay techniques for detecting lesions in a 77 gene, and importantly, provides 
the ability to discern between different molecular causes underlying 77-dependent 
aberrant cell growth, proliferation and/or differentiation. 

15 In an exemplary embodiment, there is provided a nucleic acid composition 

comprising a (purified) oligonucleotide probe including a region of nucleotide sequence 
which is capable of hybridizing to a sense or antisense sequence of a 77 gene, such as 
represented by any of SEQ ID NOs:l -6, or naturally occurring mutants thereof, or 5 1 or 
3* flanking sequences or intronic sequences naturally associated with the subject TI 

20 genes or naturally occurring mutants thereof. The nucleic acid of a cell is rendered 

accessible for hybridization, the probe is exposed to nucleic acid of the sample, and the 
hybridization of the probe to the sample nucleic acid is detected. Such techniques can 
be used to detect lesions at either the genomic or mRNA level, including deletions, 
substitutions, etc., as well as to determine mRNA transcript levels. 

25 As set out above, one aspect of the present invention relates to diagnostic assays 

for determining, in the context of cells isolated from a patient, if mutations have arisen 
in one or more Tl of the sample cells. The present method provides a method for 
determining if a subject is at risk for a disorder characterized by aberrant cell 
proliferation and/or differentiation. In preferred embodiments, the method can be 

30 generally characterized as comprising detecting, in a sample of cells from the sub ject, 
the presence or absence of a genetic lesion characterized by an alteration affecting the 
integrity of a gene encoding a 77. To illustrate, such genetic lesions can be detected by 
ascertaining the existence of at least one of (i) a deletion of one or more nucleotides 
from a 77-gene, (ii) an addition of one or more nucleotides to a 77-gcne, (iii) a 

35 substitution of one or more nucleotides of a 77-gene, and (iv) the presence of a non-wild 
type splicing pattern of a messenger RNA transcript of a 77-gene. As set out below, the 
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present invention provides a large number of assay techniques for detecting lesions in 77 
genes. 

In certain embodiments, detection of the lesion comprises utilizing the 
probe/primer in a polymerase chain reaction (PGR) (see, e.g. U.S. Patent Nos 4 683 195 
5 and 4,683,202), such as anchor PGR or RACE PGR, or, alternatively, in a ligation chain 
reaction (LCR)(see, e.g., Landegran etal. (1988) Science 241:1077-1080- and 
Nakazawa et a.. (1994) PNAS91 :360-364), the latter of which can be particularly useful 
for detecting point mutations in the 7V-gene (see Abravaya et al. ( 1 995) Nuc Acid Res 
23:675-682). In a merely illustrative embodiment, the method includes the steps of (i) 

n^T 8 h Tf ° f r' ,S fr ° m 3 P3tient ' 00 iSOlatin8 nUC,dC add S-mic, 
mRNA or both) from the cells of the sample, (iii) contacting the nucleic acid sample 

w.th one or more primers which specifically hybridize to a 77 gene under conditions 

such that hybridization and amplification of the TY-gene (if present) occurs and (iv) 

detecnng the presence or absence of an amplification product, or detecung the size of the 

am P hficat,on product and comparing the length to a control sample. „ is anticipated 

that PGR and/or LCR may be desirable to use as a preliminary amplification step in 

con.unct.on with any of the techniques used for detecting mutations described herein 

Another embodiment of the invention provides for a nucleic acid composition 
compn Slng a (purified) oligonudeotide prof)e & ^ of 

wh.ch .s capable of hybridizing to a sense or antisense sequence of a 77- g ene or 
naturally occurring mutants thereof, or 5' or 3' flanking sequences or intronic sequences 
naturally associated with the subject 77-genes or naturally occurring mutants thereof 
THe nuc.e.c acid of a cell is rendered accessible for hybridization, the probe is exposed 
to nucleic acid of the sample, and the hybridization of the probe to the sample nucleic 
«jd » detected. Such techniques can be used to detect lesions at either the genomic or 
mRNA level, .nclud.ng deletions, substitutions, etc., as well as to determine mRNA 
transcr.pt levels. Such oligonucleotide probes can be used for both predictive and 
therapeutic evaluation of allelic mutations which might be manifest in, for example 
apoptosis or aberrant cell growth. ' 

The methods described herein may be performed, for example, by utilizing pre- 
packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
desenbed here.n, which may be conveniently used, , g ., ,„ cIinicai set(ings to ^ 
patterns exh.biting symptoms or family history of a disease or illness involving a 77 
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Ant.bod.es d.rected against wild type or mutant 77 proteins, which are discussed 
above mayalsobeusedindiseasediagnosticsandprognostic, Such diagnostic ' ' 
methods, may be used to detect abnormalities in the level of 77 protein expression or 
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abnormalities in the structure and/or tissue, cellular, or subcellular location of 77 protein. 
Structural differences may include, for example, differences in the size, 
electronegativity, or antigenicity of the mutant 77 protein relative to the normal 77 
protein. Protein from the tissue or cell type to be analyzed may easily be detected or 
5 isolated using techniques which are well known to one of skill in the art, including but 
not limited to western blot analysis. For a detailed explanation of methods for carrying 
out western blot analysis, see Sambrook et al, 1989, supra, at Chapter 18. The protein 
detection and isolation methods employed herein may also be such as those described in 
Harlow and Lane, for example, (Harlow, E. and Lane, D., 1988, "Antibodies: A 
10 Laboratory Manual", Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York), which is incorporated herein by reference in its entirety. 

This can be accomplished, for example, by immunofluorescence techniques 
employing a fluorescently labeled antibody (see below) coupled with light microscopic, 
flow cytometric, or fluorimetric detection. The antibodies (or fragments thereof) useful 
15 in the present invention may, additionally, be employed histologically, as in 

immunofluorescence or immunoelectron microscopy, for in situ detection of 77 proteins. 
In siiu detection may be accomplished by removing a histological specimen from a 
patient, and applying thereto a labeled antibody of the present invention. The antibody 
(or fragment) is preferably applied by overlaying the labeled antibody (or fragment) onto 
20 a biological sample. Through the use of such a procedure, it is possible to determine not 
only the presence of the 77 protein, but also its distribution in the examined tissue. 
Using the present invention, one of ordinary skill will readily perceive that any of a wide 
variety of histological methods (such as staining procedures) can be modified in order to 
achieve such in situ detection. 
25 Often a solid phase support or carrier is used as a support capable of binding an 

antigen or an antibody. Well-known supports or carriers include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, 
polyacrylamides, gabbros, and magnetite. The nature of the carrier can be either soluble 
to some extent or insoluble for the purposes of the present invention. The support 
30 material may have virtually any possible structural configuration so long as the coupled 
molecule is capable of binding to an antigen or antibody. Thus, the support 
configuration may be spherical, as in a bead, or cylindrical, as in the inside surface of a 
test tube, or the external surface of a rod. Alternatively, the surface may be flat such as a 
sheet, test strip, etc. Preferred supports include polystyrene beads. Those skilled in the 
35 art will know many other suitable carriers for binding antibody or antigen, or will be 
able to ascertain the same by use of routine experimentation. 
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One means for labeling an anti-77 protein specific antibody is via linkage to an 
enzyme and use in an enzyme immunoassay (EIA) (Voller, "The Enzyme Linked 
Immunosorbent Assay (ELISA)", Diagnostic Horizons 2:1-7, 1978, Microbiological 
Associates Quarterly Publication, Walkersville, MD; Voller, et al., J. Clin. Pathol. 
31:507-520 (1978); Butler, Meth. Enzymol. 73:482-523 (1981); Maggio, (ed.) Enzyme 
Immunoassay, CRC Press, Boca Raton, FL, 1980; Ishikawa, et al., (eds.) Enzyme 
Immunoassay, Kgaku Shoin, Tokyo, 1981). The enzyme which is bound to the antibody 
will react with an appropriate substrate, preferably a chromogenic substrate, in such a 
manner as to produce a chemical moiety which can be detected, for example, by 
spectrophotometric, fluorimetric or by visual means. Enzymes which can be used to 
detectably label the antibody include, but are not limited to, malate dehydrogenase, 
staphylococcal nuclease, delta-5-steroid isomerase, yeast alcohol dehydrogenase, alpha- 
glycerophosphate, dehydrogenase, triose phosphate isomerase, horseradish peroxidase, 
alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, 
1 5 urease, catalase, glucosc-6-phosphate dehydrogenase, glucoamylase and 

acetylcholinesterase. The detection can be accomplished by colorimetric methods which 
employ a chromogenic substrate for the enzyme. Detection may also be accomplished 
by visual comparison of the extent of enzymatic reaction of a substrate in comparison 
with similarly prepared standards. 
20 Detection may also be accomplished using any of a variety of other 

immunoassays. For example, by radioactively labeling the antibodies or antibody 
fragments, it is possible to detect fingerprint gene wild type or mutant peptides through 
the use of a radioimmunoassay (RIA) (see, for example. Weintraub, B., Principles of 
Radioimmunoassays, Seventh Training Course on Radioligand Assay Techniques, The 
25 Endocrine Society, March, 1 986, which is incorporated by reference herein). The 

radioactive isotope can be detected by such means as the use of a gamma counter or a 
scintillation counter or by autoradiography. 

It is also possible to label the antibody with a fluorescent compound. When the 
fluorescently labeled antibody is exposed to light of the proper wave length, its presence 
can then be detected due to fluorescence. Among the most commonly used fluorescent 
labeling compounds are fluorescein isothiocyanate, rhodamine, phycoerythrin. 
phycocyanin, allophycocyanin, o-phthaldehyde and fluorcscamine. 

The antibody can also be detectably labeled using fluorescence emitting metals 
such as > Eu, or others of the lanthanide series. These metals can be attached to the 
antibody using such metal chelating groups as diethylenetriaminepentacctic acid 
(DTPA) or ethylenediaminetetraacetic acid (EDTA). 
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The antibody also can be detectably labeled by coupling it to a chemiluminescent 
compound. The presence of the chemiluminescent-tagged antibody is then determined 
by detecting the presence of luminescence that arises during the course of a chemical 
reaction. Examples of particularly useful chemiluminescent labeling compounds are 
5 luminol, isoluminol, theromatic acridinium ester, imidazole, acridinium salt and oxalate 
ester. 

Likewise, a bioluminescent compound may be used to label the antibody of the 
present invention. Bioluminescence is a type of chemiluminescence found in biological 
systems in, which a catalytic protein increases the efficiency of the chemiluminescent 
10 reaction. The presence of a bioluminescent protein is determined by detecting the 
presence of luminescence. Important bioluminescent compounds for purposes of 
labeling are luciferin, luciferasc and aequorin. 

Moreover, it will be understood that any of the above methods for detecting 
alterations in a 77 gene or gene product can be used to monitor the course of treatment or 
1 5 therapy. 

Drup Screening Assays 

In drug screening assays described herein, in addition to the 77 nucleic acid 
molecules and polypeptides described above, the present invention also provides for the 
20 use of nucleic acid molecules comprising at least a portion of a 77 nucleic acid molecule, 
for example, at least a portion of a sequence shown in SEQ ID NOs:l-6 or polypeptides 
encoded by at least a portion of the nucleic acid sequence shown in any of SEQ ID 
NOs:l-6. 

Furthermore, by making available purified and recombinant 77 polypeptides, the 
25 present invention facilitates the development of assays which can be used to screen for 
drugs, including homologues, which are either agonists or antagonists of the normal 
cellular function of the subject polypeptides. In one embodiment, the assay evaluates 
the ability of a compound to modulate binding between a 77 polypeptide and a molecule, 
be it protein or DNA, that interacts either upstream or downstream of the 77 polypeptide 
30 in a lipid transfer pathway. A variety of assay formats will suffice and, in light of the 
present inventions, will be comprehended by a skilled artisan. 

Cell -Free Assays 

In many drug screening programs which test libraries of compounds and natural 
35 extracts, high throughput assays are desirable in order to maximize the number of 

compounds surveyed in a given period of time. Assays which are performed in cell-free 
systems, such as may be derived with purified or semi-purified proteins, are often 
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preferred as "primary" screens in that they can be generated to permit rapid development 
and relatively easy detection of an alteration in a molecular target which is mediated by 
a test compound. Moreover, the effects of cellular toxicity and/or bioavailability of the 
test compound can be generally ignored in the in vitro system, the assay instead being 
focused primarily on the effect of the drug on the molecular target as may be manifest in 
an alteration of binding affinity with upstream or downstream elements. Accordingly in 
an exemplary screening assay of the present invention, the compound of interest is 
contacted with proteins which may function upstream (including both activators and 
repressors of its activity) or to proteins or nucleic acids which may function downstream 
of the 77 polypeptide, whether they are positively or negatively regulated by it. To the 
mixture of the compound and the upstream or downstream element is then added a 
composition containing a 77 polypeptide. Detection and quantification of complexes of 
77 with it's upstream or downstream elements provide a means for determining a 
compound's efficacy at inhibiting (or potentiating) complex formation between 77 and 
the 7/-binding elements. The efficacy of the compound can be assessed by generating 
dose response curves from data obtained using various concentrations of the test 
compound. Moreover, a control assay can also be performed to provide a baseline for 
comparison. In the control assay, isolated and purified 77 polypeptide is added to a 
composition containing the 77-binding element, and the formation of a complex is 
20 quantitated in the absence of the test compound. 

Complex formation between the 77 polypeptide and a binding element (e g Tub) 
may be detected by a variety of techniques. Modulation of the formation of complexes 
can be quantitated using, for example, detectably labeled proteins such as radiolabeled 
fluorescently labeled, or enzymatically labeled 77 polypeptides, by immunoassay, or by 
25 chromatographic detection. 

Typically, it will be desirable to immobilize either 77 or its binding protein to 
facilitate separation of complexes from uncomplexed forms of one or both of the 
proteins, as well as to accommodate automation of the assay. Binding of TI to an 
upstream or downstream element, in the presence and absence of a candidate agent can 
be accomplished in any vessel suitable for containing the reactants. Examples include 
m.crotitre plates, test tubes, and micro-centrifuge tubes. In one embodiment a fusion 
protem can be provided which adds a domain that allows the protein to be bound to a 
matnx. For example, glutathione-S-transferase/77 (GST/77) fusion proteins can be 
adsorbed onto glutathione scpharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione derivatized microtitre plates, which are then combined with the cell lysates 
e.g. an 35 S -|abeled, and the test compound, and the mixture incubated under conditions' 
conducve to complex formation, e.g. at physiological conditions for salt and P H. though 
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slightly more stringent conditions may be desired. Following incubation, the beads are 
washed to remove any unbound label, and the matrix immobilized and radiolabel 
determined directly (e.g. beads placed in scintillant), or in the supernatant after the 
complexes are subsequently dissociated. Alternatively, the complexes can be 
5 dissociated from the matrix, separated by SDS-PAGE, and the level of 77-binding 

protein found in the bead fraction quantitated from the gel using standard elcctrophoretic 
techniques such as described in the appended examples. 

Other techniques for immobilizing proteins on matrices are also available for use 
in the subject assay. For instance, either 77 or its cognate binding protein can be 

10 immobilized utilizing conjugation of biotin and streptavidin. For instance, biotinylated 
77 molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques well known in the art (e.g., biotinylation kit. Pierce Chemicals, Rockford, 
1L), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce 
Chemical). Alternatively, antibodies reactive with 77 but which do not interfere with 

1 5 binding of upstream or downstream elements can be derivatized to the wells of the plate, 
and 77 trapped in the wells by antibody conjugation. As above, preparations of a 77- 
binding protein and a test compound are incubated in the 77-presenting wells of the 
plate, and the amount of complex trapped in the well can be quantitated. Exemplary 
methods for detecting such complexes, in addition to those described above for the GST- 

20 immobilized complexes, include immunodetection of complexes using antibodies 

reactive with the TI binding element, or which are reactive with 77 protein and compete 
with the binding element; as well as enzyme-linked assays which rely on detecting an 
enzymatic activity associated with the binding element, either intrinsic or extrinsic 
activity. In the instance of the latter, the enzyme can be chemically conjugated or 

25 provided as a fusion protein with the TI binding protein. To illustrate, the 77 binding 
protein can be chemically cross-linked or genetically fused with horseradish peroxidase, 
and the amount of polypeptide trapped in the complex can be assessed with a 
chromogenic substrate of the enzyme, e.g. 3,3 f -diamino-benzadine tetrahydrochloride or 
4-chloro-l -napthol. Likewise, a fusion protein comprising the polypeptide and 

30 glutathione-S-transferase can be provided, and complex formation quantitated by 

detecting the GST activity using 1 -chIoro-2,4-dinitrobenzcne (Habig et al ( 1 974) J. Biol. 
Chem. 249:7130). 

For processes which rely on immunodetection for quantitating one of the 
proteins trapped in the complex, antibodies against the protein, such as anti-77 
35 antibodies, can be used. Alternatively, the protein to be detected in the complex can be 
"epitope tagged" in the form of a fusion protein which includes, in addition to the TI 
sequence, a second polypeptide for which antibodies are readily available (e.g. from 
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commercial sources). For instance, the GST fusion proteins described above can also be 
used for quantification of binding using antibodies against the GST moiety. Other 
useful epitope tags include myc-epitopes (e.g., see Ellison et al. (1991) J. Biol. Chem. 
266:21 150-21 157) which includes a 10-residue sequence from c-myc, as well as the 
pFLAG system (International Biotechnologies, Inc.) or the pEZZ-protein A system 
(Pharmacia, NJ). 

Cell Based Assay s 

In addition to cell-free assays, such as described above, the readily available 77 
proteins provided by the present invention also facilitates the generation of cell-based 
assays for identifying small molecule agonists/antagonists and the like. For example 
cells which are sensitive to apoptosis can be caused to overexpress a recombinant 77 ' 
protein in the presence and absence of a test agent of interest, with the assay scoring for 
modulation in 77 responses by the target cell mediated by the test agent. As with the 
cell-free assays, agents which produce a statistically significant change in 77-dependent 
responses (either inhibition or potentiation) can be identified. In an illustrative 
embodiment, the expression or activity of a 77 is modulated in embryos or cells and the 
effects of compounds of interest on the readout of interest (such as apoptosis) are 
measured. For example, the expression of genes which are up- or down-regulated in 
response to a 77-dependent signal cascade can be assayed. In preferred embodiments 
the regulatory regions of such genes, e.g., the 5' flanking promoter and enhancer regions 
are operably linked to a detectable marker (such as luciferase) which encodes a gene 
product that can be! readily detected. 

Further, the transgenic animals described herein may be used to generate cell 
lines, containing one or more cell types involved in a weight disorder, that can be used 
as cell culture models for diseases or disorders described herein. While primary cultures 
derived from transgenic animals of the invention may be utilized, the generation of 
continuous cell lines is preferred. For examples of techniques which may be used to 
denve a continuous cell line from the transgenic animals, see Small et al (1985) Mol 
30 Celt Biol. 5:642-648. 

In the event that the 77 proteins themselves, or in complexes with other proteins 
are capable of binding DNA and modifying transcription of a gene, a transcriptional 
based assay could be used, for example, in which a 77- responsive regulatory sequence 
is operably linked to a detectable marker gene. 

Monitoring the influence of compounds on cells may be applied not only in basic 
drug screening, but also in clinical trials. In such clinical trials, the expression of a panel 
of genes may be used as a "read out" of a particular drug's therapeutic effect 
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In yet another aspect of the invention, the subject 77 polypeptides can be used to 
generate a "two hybrid" assay (see, for example, U.S. Patent No. 5,283,3 1 7; Zervos et al. 
(1993) Cell 72:223-232; Madura et al. (1993) 1 Biol Chem. 268:12046-12054; Bartel et 
al. (1993) Biotechniques 14:920-924; Iwabuchi et al. (1993) Oncogene 8:1693-1696; 

5 and PCT Publication No. WO 94/10300), for isolating coding sequences for other 

cellular proteins which bind to or interact with 77, such as the C-terminus of tub, and the 
like. Briefly, the two hybrid assay relies on reconstituting in vivo a functional 
transcriptional activator protein from two separate fusion proteins. In particular, the 
method makes use of chimeric genes which express hybrid proteins. To illustrate, a first 

1 0 hybrid gene comprises the coding sequence for a DNA-binding domain of a 

transcriptional activator fused in frame to the coding sequence for a 77 polypeptide. The 
second hybrid protein encodes a transcriptional activation domain fused in frame to a 
sample gene from a cDNA library. If the bait and sample hybrid proteins are able to 
interact, e.g., form a 77-dependent complex, they bring into close proximity the two 

15 domains of the transcriptional activator. This proximity is sufficient to cause 

transcription of a reporter gene which is operably linked to a transcriptional regulatory 
site responsive to the transcriptional activator, and expression of the reporter gene can be 
detected and used to score for the interaction of the 77 and sample proteins. The use of 
the subject TI molecules in a three hybrid assay which allows for phosphorylation of the 

20 assay components, such as for example by the inclusion of sre, or the PDGF cytoplasmic 
domain is also provided for. 



Transgenic Animals 

These systems may be used in a variety of applications. For example, the cell- 
25 and animal-based model systems may be used to further characterize 77 genes and 
proteins. In addition, such assays may be utilized as part of screening strategies 
designed to identify compounds which are capable of ameliorating disease symptoms. 
Thus, the animal- and cell-based models may be used to identify drugs, pharmaceuticals, 
therapies and interventions which may be effective in treating disease. 

30 

Animal-Based Systems 

One aspect of the present invention concerns transgenic animals which arc 
comprised of cells (of that animal) which contain a transgene of the present invention 
and which preferably (though optionally) express an exogenous TI protein in one or 
35 more cells in the animal. A 77 transgene can encode the wild-type form of the protein, 
or can encode homologues thereof, including both agonists and antagonists, as well as 
antisensc constructs. In preferred embodiments, the expression of the transgene is 
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restricted to specific subsets of cells, tissues or developmental stages utilizing, for 
example, cis-acting sequences that control expression in the desired pattern. In the 
present invention, such mosaic expression of a 77 protein can be essential for many 
forms of lineage analysis and can additionally provide a means to assess the effects of, 
5 for example, lack of 77 expression which might grossly alter development in small 
patches of tissue within an otherwise normal embryo. Toward this and, tissue-specific 
regulatory sequences and conditional regulatory sequences can be used to control 
expression of the transgene in certain spatial patterns. Moreover, temporal patterns of 
expression can be provided by, for example, conditional recombination systems or 
1 0 prokaryotic transcriptional regulatory sequences. 

Genetic techniques which allow for the expression of transgenes can be regulated 
via site-specific genetic manipulation in vivo are known to those skilled in the art. For 
instance, genetic systems are available which allow for the regulated expression of a 
recombinase that catalyzes the genetic recombination a target sequence. As used herein, 
the phrase "target sequence" refers to a nucleotide sequence that is genetically 
recombined by a recombinase. The target sequence is flanked by recombinase 
recognition sequences and is generally either excised or inverted in cells expressing 
recombinase activity. Recombinase catalyzed recombination events can be designed 
such that recombination of the target sequence results in either the activation or 
repression of expression of one of the subject 77 proteins. For example, excision of a 
target sequence which interferes with the expression of a recombinant 77 gene, such as 
one which encodes an antagonistic homologue or an antisense transcript, can be 
designed to activate expression of that gene. This interference with expression of the 
protein can result from a variety of mechanisms, such as spatial separation of the 77 gene 
from the promoter element or an internal stop codon. Moreover, the transgene can be 
made wherein the coding sequence of the gene is flanked by recombinase recognition 
sequences and is initially transfected into cells in a 3' to 5" orientation with respect to the 
promoter element. In such an instance, inversion of the target sequence will reorient the 
subject gene by placing the 5' end of the coding sequence in an orientation with respect 
to the promoter element which allow for promoter driven transcriptional activation. 

The transgenic animals of the present invention all include within a plurality of 
their cells a transgene of the present invention, which transgene alters the phenotype of 
the "host cell" with respect to regulation of cell growth, death and/or differentiation. 
Since it is possible to produce transgenic organisms of the invention utilizing one or 
more of the transgene constructs described herein, a general description will be given of 
the production of transgenic organisms by referring generally to exogenous genetic 
material. This general description can be adapted by those skilled in the art in order to 
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incorporate specific transgene sequences into organisms utilizing the methods and 
materials described below. 

In an illustrative embodiment, either the crelloxP recombinase system of 
bacteriophage PI (Lakso et al. (1992) Proc. Natl Acad, ScL USA 89:6232-6236; Orban 
5 et al. (1 992) Proa Natl Acad. ScL USA 89:6861-6865) or the FLP recombinase system 
of Saccharomyces cerevisiae (O'Gorman et al . ( 1 99 1 ) Science 25 1:1351-1355; PCT 
Publication No. WO 92/15694) can be used to generate in vivo site-specific genetic 
recombination systems. Cre recombinase catalyzes the site-specific recombination of an 
intervening target sequence located between loxP sequences. loxP sequences are 34 

1 0 base pair nucleotide repeat sequences to which the Cre recombinase binds and are 

required for Cre recombinase mediated genetic recombination. The orientation of ioxP 
sequences determines whether the intervening target sequence is excised or inverted 
when Cre recombinase is present (Abremski et al. (1984) J. Biol. Chem. 259:1509- 
1514); catalyzing the excision of the target sequence when the loxP sequences are 

15 oriented as direct repeats and catalyzes inversion of the target sequence when loxP 
sequences are oriented as inverted repeats. 

Accordingly, genetic recombination of the target sequence is dependent on 
expression of the Cre recombinase. Expression of the recombinase can be regulated by 
promoter elements which are subject to regulatory control, e.g., tissue-specific, 

20 developmental stage-specific, inducible or repressible by externally added agents. This 
regulated control will result in genetic recombination of the target sequence only in cells 
where recombinase expression is mediated by the promoter element. Thus, the 
activation expression of a recombinant 77 protein can be regulated via control of 
recombinase expression. 

25 Use of the crelloxP recombinase system to regulate expression of a recombinant 

77 protein requires the construction of a transgenic animal containing transgenes 
encoding both the Cre recombinase and the subject protein. Animals containing both the 
Cre recombinase and a recombinant 77 gene can be provided through the construction of 
"double" transgenic animals. A convenient method for providing such animals is to 

30 mate two transgenic animals each containing a transgene. e.g., a 77 gene and 
recombinase gene. 

One advantage derived from initially constructing transgenic animals containing 
a 77 transgene in a recombinase-mediated expressible format derives from the likelihood 
that the subject protein, whether agonistic or antagonistic, can be deleterious upon 
35 expression in the transgenic animal. In such an instance, a founder population, in which 
the subject transgene is silent in all tissues, can be propagated and maintained. 
Individuals of this founder population can be crossed with animals expressing the 
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recombinase in, for example, one or more tissues and/or a desired temporal pattern. 
Thus, the creation of a founder population in which, for example, an antagonistic 77 
transgene is silent will allow the study of progeny from that founder in which disruption 
of 77 mediated induction in a particular tissue or at certain developmental stages would 
> result in, for example, a lethal phenotype. 

Similar conditional transgenes can be provided using prokaryotic promoter 
sequences which require prokaryotic proteins to be simultaneous expressed in order to 
facilitate expression of the 77 transgene. Exemplary promoters and the corresponding 
trans-activat.ng prokaryotic proteins are given in U.S. Patent No. 4,833,080. 

Moreover, expression of the conditional transgenes can be induced by gene 
therapy-like methods wherein a gene encoding the trans-activating protein, e.g. a 
recombinase or a prokaryotic protein, is delivered to the tissue and caused to be 
expressed, such as in a cell-type specific manner. By this method, a TV transgene could 
remain silent into adulthood until "turned on" by the introduction of the trans-activator 

In an exemplary embodiment, the "transgenic non-human animals" of the 
invention are produced by introducing transgenes into the germline of the non-human 
animal. Embryonal target cells at various developmental stages can be used to introduce 
transgenes. Different methods are used depending on the stage of development of the 
embryonal target cell. The specific line(s) of any animal used to practice this invention 
are selected for general good health, good embryo yields, good pronuclear visibility in 
the embryo, and good reproductive fitness. In addition, the haplotype is a significant 
factor. For example, when transgenic mice are to be produced, strains such as C57BL/6 
or FVB lines are often used (Jackson Laboratory, Bar Harbor, ME). Preferred strains are 
those with H-2b H-21 or H-2d haplotypes such as C57BL/6 or DBA/1 . The line(s) used 
to practice this invention may themselves be transgenics, and/or may be knockouts (i e 
obtained from animals which have one or more genes partially or completely 
suppressed) . 

In one embodiment, the transgene construct is introduced into a single stage 
embryo. The zygote is the best target for micro-injection. In the mouse, the male 
pronucleus reaches the size of approximately 20 micrometers in diameter which allows 
reproducible injection of l-2pl of DNA solution. The use of zygotes as a target for gene 
transfer has a major advantage in that in most cases the injected DNA will be 
incorporated into the host gene before the first cleavage (Brinster et al (1 985) Proc 
Natl. Acad. Sci. USA 82:4438-4442). As a consequence, all cells of the transgenic 
animal will carry the incorporated transgene. This will in general also be reflected in the 
efficient transmission of the transgene to offspring of the founder since 50% of the germ 
cells will harbor the transgene. 
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Normally, fertilized embryos are incubated in suitable media until the pronuclei 
appear. At about this time, the nucleotide sequence comprising the transgene is 
introduced into the female or male pronucleus as described below. In some species such 
as mice, the male pronucleus is preferred. It is most preferred that the exogenous 
5 genetic material be added to the male DNA complement of the zygote prior to its being 
processed by the ovum nucleus or the zygote female pronucleus. It is thought that the 
ovum nucleus or female pronucleus release molecules which affect the male DNA 
complement, perhaps by replacing the protamines of the male DNA with histones, 
thereby facilitating the combination of the female and male DNA complements to form 

1 0 the diploid zygote. 

Thus, it is preferred that the exogenous genetic material be added to the male 
complement of DNA or any other complement of DNA prior to its being affected by the 
female pronucleus. For example, the exogenous genetic material is added to the early 
male pronucleus, as soon as possible after the formation of the male pronucleus, which 

1 5 is when the male and female pronuclei are well separated and both are located close to 
the cell membrane. Alternatively, the exogenous genetic material could be added to the 
nucleus of the sperm after it has been induced to undergo decondensation. Sperm 
containing the exogenous genetic material can then be added to the ovum or the 
decondensed sperm could be added to the ovum with the transgene constructs being 

20 added as soon as possible thereafter. 

Introduction of the transgene nucleotide sequence into the embryo may be 
accomplished by any means known in the art such as, for example, microinjection, 
electroporation, or lipofection. Following introduction of the transgene nucleotide 
sequence into the embryo, the embryo may be incubated in vitro for varying amounts of 

25 time, or reimplanted into the surrogate host, or both. In vitro incubation to maturity is 
within the scope of this invention. One common method in to incubate the embryos in 
vitro for about 1 -7 days, depending on the species, and then reimplant them into the 
surrogate host. 

For the purposes of this invention a zygote is essentially the formation of a 
30 diploid cell which is capable of developing into a complete organism. Generally, the 
zygote will be comprised of an egg containing a nucleus formed, cither naturally or 
artificially, by the fusion of two haploid nuclei from a gamete or gametes. Thus, the 
gamete nuclei must be ones which are naturally compatible, i.e., ones which result in a 
viable zygote capable of undergoing differentiation and developing into a functioning 
35 organism. Generally, a euploid zygote is preferred. If an aneuploid zygote is obtained, 
then the number of chromosomes should not vary by more than one with respect to the 
euploid number of the organism from which either gamete originated. 
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In addition to similar biological considerations, physical ones also govern the 
amount (e.g., volume) of exogenous genetic material which can be added to the nucleus 
of the zygote or to the genetic material which forms a part of the zygote nucleus. If no 
genetic material is removed, then the amount of exogenous genetic material which can 
be added is limited by the amount which will be absorbed without being physically 
disruptive. Generally, the volume of exogenous genetic material inserted will not exceed 
about 10 picoliters. The physical effects of addition must not be so great as to physically 
destroy the viability of the zygote. The biological limit of the number and variety of 
DNA sequences will vary depending upon the particular zygote and functions of the 
exogenous genetic material and will be readily apparent to one skilled in the art, because 
the genetic material, including the exogenous genetic material, of the resulting zygote 
must be biologically capable of initiating and maintaining the differentiation and 
development of the zygote into a functional organism. 

The number of copies of the transgene constructs which are added to the zygote 
is dependent upon the total amount of exogenous genetic material added and will be the 
amount which enables the genetic transformation to occur. Theoretically only one copy 
is required; however, generally, numerous copies are utilized, for example, 1,000-20.000 
copies of the transgene construct, in order to insure that one copy is functional. As 
regards the present invention, there will often be an advantage to having more than one 
functioning copy of each of the inserted exogenous DNA sequences to enhance the 
phenotypic expression of the exogenous DNA sequences. 

Any technique which allows for the addition of the exogenous genetic material 
into nucleic genetic material can be utilized so long as it is not destructive to the cell 
nuclear membrane or other existing cellular or genetic structures. The exogenous genetic 
material is preferentially inserted into the nucleic genetic material by microinjection. 
Microinjection of cells and cellular structures is known and is used in the art. 

Reimplantation is accomplished using standard methods. Usually, the surrogate 
host is anesthetized, and the embryos are inserted into the oviduct. The number of 
embryos implanted into a particular host will vary by species, but will usually be 
30 comparable to the number of offspring the species naturally produces. 

Transgenic offspring of the surrogate host may be screened for the presence 
and/or expression of the transgene by any suitable method. Screening is often 
accomplished by Southern blot or Northern blot analysis, using a probe that is 
complementary to at least a portion of the transgene. Western blot analysis using an 
antibody against the protein encoded by the transgene may be employed as an alternative 
or add.t.onal method for screening for the presence of the transgene product. Typically 
DNA ,s prepared from tail tissue and analyzed by Southern analysis or PCR for the 
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transgene. Alternatively, the tissues or cells believed to express the transgenc at the 
highest levels are tested for the presence and expression of the transgene using Southern 
analysis or PCR, although any tissues or cell types may be used for this analysis. 

Alternative or additional methods for evaluating the presence of the transgene 

5 include, without limitation, suitable biochemical assays such as enzyme and/or 

immunological assays, histological stains for particular marker or enzyme activities, 
flow cytometric analysis, and the like. Analysis of the blood may also be useful to 
detect the presence of the transgene product in the blood, as well as to evaluate the effect 
of the transgene on the levels of various types of blood cells and other blood 

1 0 constituents. 

Progeny of the transgenic animals may be obtained by mating the transgenic 
animal with a suitable partner, or by in vitro fertilization of eggs and/or sperm obtained 
from the transgenic animal. Where mating with a partner is to be performed, the partner 
may or may not be transgenic and/or a knockout; where it is transgenic, it may contain 
] 5 the same or a different transgene, or both. Alternatively, the partner may be a parental 
line. Where in vitro fertilization is used, the fertilized embryo may be implanted into a 
surrogate host or incubated in vitro, or both. Using either method, the progeny may be 
evaluated for the presence of the transgene using methods described above, or other 
appropriate methods. 

20 The transgenic animals produced in accordance with the present invention will 

include exogenous genetic material. As set out above, the exogenous genetic material 
will, in certain embodiments, be a DNA sequence which results in the production of a 77 
protein (either agonistic or antagonistic), and antisense transcript, or a 77 mutant. 
Further, in such embodiments the sequence will be attached to a transcriptional control 

25 element, e.g., a promoter, which preferably allows the expression of the transgene 
product in a specific type of cell. 

Retroviral infection can also be used to introduce transgene into a non-human 
animal. The developing non-human embryo can be cultured in vitro to the blastocyst 
stage. During this time, the blastomcres can be targets for retroviral infection (Jaenich, 

30 R. (1976) Proc. Nati Acad. Sci USA 73:1260-1264). Efficient infection of the 
blastomeres is obtained by enzymatic treatment to remove the zona pellucida 
(Manipulating the Mouse Embryo. Hogan eds. (Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, 1 986). The viral vector system used to introduce the transgene is 
typically a replication-defective retrovirus carrying the transgene (Jahner et al. (1985) 

35 Proc. Natl. Acad Sci. 82:6927-693 1 ; Van der Putten et al. (1 985) Proc. Natl. Acad. Sci. 
USA 82:6148-6152). Transfection is easily and efficiently obtained by culturing the 
blastomeres on a monolayer of virus-producing cells (Van der Putten, supra', Stewart et 
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al. (1987) EM BO J. 6:383-388). Alternatively, infection can be performed at a later 
stage. Virus or virus-producing cells can be injected into the blastocoele (Jahner et al. 
(1 982) Nature 298:623-628). Most of the founders will be mosaic for the transgcnc since 
incorporation occurs only in a subset of the cells which formed the transgenic non- 
5 human animal . Further, the founder may contain various retroviral insertions of the 
transgene at different positions in the genome which generally will segregate in the 
offspring. In addition, it is also possible to introduce transgenes into the germ line by 
intrauterine retroviral infection of the midgestation embryo (Jahner et al., supra). 

A third type of target cell for transgene introduction is the embryonal stem cell 

1 0 (ES). ES cells are obtained from pre-implantation embryos cultured in vitro and fused 
with embryos (Evans et al. (1981) Nature 292:1 54-1 56; Bradley et al. (1984) Nature 
309:255-258; Gossler et al. (1986) Proc. Natl. Acad. Sci. USA 83:9065-9069; and 
Robertson et al. (1986) Nature 322:445-448). Transgenes can be efficiently introduced 
into the ES cells by DNA transfection or by retrovirus-mediated transduction. Such 

1 5 transformed ES cells can thereafter be combined with blastocysts from a non-human 
animal. The ES cells thereafter colonize the embryo and contribute to the germ line of 
the resulting chimeric animal. For review see Jaenisch, R. (1988) Science 240- 1468- 
1474. 

In one embodiment, gene targeting, which is a method of using homologous 
recombination to modify an animal's genome, can be used to introduce changes into 
cultured embryonic stem cells. By targeting a 77 gene of interest in ES cells, these 
changes can be introduced into the germlines of animals to generate chimeras. The gene 
targeting procedure is accomplished by introducing into tissue culture cells a DNA 
targeting construct that includes a segment homologous to a target 77 locus, and which 
25 also includes an intended sequence modification to the 77 genomic sequence (e.g., 
insertion, deletion, point mutation). The treated cells are then screened for accurate 
targeting to identify and isolate those which have been properly targeted. 

Gene targeting in embryonic stem cells is in fact a scheme contemplated by the 
present invention as a means for disrupting a 77 gene function through the use of a 
30 targeting transgene construct designed to undergo homologous recombination with one 
or more 77 genomic sequences. The targeting construct can be arranged so that, upon 
recombination with an element of a 77 gene, a positive selection marker is inserted into 
(or replaces) coding sequences of the targeted gene. The inserted sequence functionally 
disrupts the 77 gene, while also providing a positive selection trait. Exemplary 77 
35 targeting constructs are described in more detail below. 

Generally, the embryonic stem cells (ES cells) used to produce the knockout 
animals will be of the same species as the knockout animal to be generated. Thus for 
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example, mouse embryonic stem cells will usually be used for generation of knockout 
mice. 

Embryonic stem cells are generated and maintained using methods well known 
to the skilled artisan such as those described by Doetschman et al. (1985) J. Embryol 
5 Exp. Morphol. 87:27-45). Any line of ES cells can be used, however, the line chosen is 
typically selected for the ability of the cells to integrate into and become part of the germ 
line of a developing embryo so as to create germ line transmission of the knockout 
construct. Thus, any ES cell line that is believed to have this capability is suitable for use 
herein. One mouse strain that is typically used for production of ES cells, is the 129J 

10 strain. Another ES cell line is murine cell line D3 (American Type Culture Collection, 
catalog no. CKL 1934) Still another preferred ES cell line is the WW6 cell line (Ioffe et 
al. (1995) Proc. Natl. Acad, Sci. USA 92:7357-7361). The cells are cultured and 
prepared for knockout construct insertion using methods well known to the skilled 
artisan, such as those set forth by Robertson in: Teratocarci nomas and Embryonic Stem 

1 5 Cells: A Practical Approach, E.J. Robertson, ed. IRL Press, Washington, D.C., 1987); by 
Bradley et al. (1986) Current Topics in DeveL Biol. 20:357-371); and by Hogan et al. 
(Manipulating the Mouse Embryo: A Laboratory Manual, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1986) . 

Insertion of the knockout construct into the ES cells can be accomplished using a 

20 variety of methods well known in the art including for example, electroporation, 

microinjection, and calcium phosphate treatment. A preferred method of insertion is 
electroporation . 

Each knockout construct to be inserted into the cell must first be in the linear 
form. Therefore, if the knockout construct has been inserted into a vector (described 
25 infra), linearization is accomplished by digesting the DNA with a suitable restriction 
endonuclease selected to cut only within the vector sequence and not within the 
knockout construct sequence. 

For insertion, the knockout construct is added to the ES cells under appropriate 
conditions for the insertion method chosen, as is known to the skilled artisan. Where 
30 more than one construct is to be introduced into the ES cell, each knockout construct can 
be introduced simultaneously or one at a time. 

If the ES cells are to be clcctroporated, the ES cells and knockout construct DNA 
are exposed to an electric pulse using an electroporation machine and following the 
manufacturer's guidelines for use. After electroporation, the ES cells arc typically 
35 allowed to recover under suitable incubation conditions. The cells arc then screened for 
the presence of the knockout construct . 
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Screening can be accomplished using a variety of methods. Where the marker 
gene is an antibiotic resistance gene, for example, the ES cells may be cultured in the 
presence of an otherwise lethal concentration of antibiotic. Those ES cells that survive 
have presumably integrated the knockout construct. If the marker gene is other than an 
5 antibiotic resistance gene, a Southern blot of the ES cell genomic DNA can be probed 
with a sequence of DNA designed to hybridize only to the marker sequence. 
Alternatively, PCR can be used. Finally, if the marker gene is a gene that encodes an 
enzyme whose activity can be detected (e.g., b-galactosidase), the enzyme substrate can 
be added to the cells under suitable conditions, and the enzymatic activity can be 
1 0 analyzed. One skilled in the art will be familiar with other useful markers and the means 
for detecting their presence in a given cell. All such markers are contemplated as being 
included within the scope of the teaching of this invention. 

The knockout construct may integrate into several locations in the ES cell 
genome, and may integrate into a different location in each ES cell's genome due to the 
1 5 occurrence of random insertion events. The desired location of insertion is in a 

complementary position to the DNA sequence to be knocked out. e.g., the 7V coding 
sequence, transcriptional regulatory sequence, etc. Typically, less than about 1-5 % of 
the ES cells that take up the knockout construct will actually integrate the knockout 
construct in the desired location. To identify those ES cells with proper integration of 
the knockout construct, total DNA can be extracted from the ES cells using standard 
methods. The DNA can then be probed on a Southern blot with a probe or probes 
designed to hybridize in a specific pattern to genomic DNA digested with particular 
restriction enzyme(s). Alternatively, or additionally, the genomic DNA can be amplified 
by PCR with probes specifically designed to amplify DNA fragments of a particular size 
25 and sequence (i.e., only those cells containing the knockout construct in the proper 
position will generate DNA fragments of the proper size). 

After suitable ES cells containing the knockout construct in the proper location 
have been identified, the cells can be inserted into an embryo. Insertion may be 
accomplished in a variety of ways known to the skilled artisan, however a preferred 
30 method is by microinjection. For microinjection, about 1 0-30 cells are collected into a 
micropipette and injected into embryos that are at the proper stage of development to 
permit integration of the foreign ES cell containing the knockout construct into the 
developing embryo. For instance, as the appended Examples describe, the transformed 
ES cells can be microinjected into blastocysts. 

The suitable stage of development for the embryo used for insertion of ES cells 
is very species dependent, however for mice it is about 3.5 days. The embryos are 
obtained by perfusing the uterus of pregnant females. Suitable methods for 
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accomplishing this are known to the skilled artisan, and are set forth by, e.g., Bradley et 
al. (supra). 

While any embryo of the right stage of development is suitable for use, preferred 
embryos are male. In mice* the preferred embryos also have genes coding for a coat 
5 color that is different from the coat color encoded by the ES cell genes. In this way, the 
offspring can be screened easily for the presence of the knockout construct by looking 
for mosaic coat color (indicating that the ES cell was incorporated into the developing 
embryo). Thus, for example, if the ES cell line carries the genes for white fur, the 
embryo selected will carry genes for black or brown fur. 

10 After the ES cell has been introduced into the embryo, the embryo may be 

implanted into the uterus of a pseudopregnant foster mother for gestation. While any 
foster mother may be used, the foster mother is typically selected for her ability to breed 
and reproduce well, and for her ability to care for the young. Such foster mothers are 
typically prepared by mating with vasectomi/ed males of the same species. The stage of 

15 the pseudopregnant foster mother is important for successful implantation, and it is 
species dependent. For mice, this stage is about 2-3 days pseudopregnant. 

Offspring that are born to the foster mother may be screened initially for mosaic 
coat color where the coat color selection strategy (as described above, and in the 
appended examples) has been employed. In addition, or as an alternative, DNA from tail 

20 tissue of the offspring may be screened for the presence of the knockout construct using 
Southern blots and/or PCR as described above. Offspring that appear to be mosaics may 
then be crossed to each other, if they are believed to carry the knockout construct in their 
germ line, in order to generate homozygous knockout animals. Homozygotes may be 
identified by Southern blotting of equivalent amounts of genomic DNA from mice that 

25 are the product of this cross, as well as mice that are known heterozygotes and wild type 
mice. 

Other means of identifying and characterizing the knockout offspring are 
available. For example, Northern blots can be used to probe the mRNA for the presence 
or absence of transcripts encoding either the gene knocked out, the marker gene, or both. 

30 In addition, Western blots can be used to assess the level of expression of the 77 gene 
knocked out in various tissues of the offspring by probing the Western blot with an 
antibody against the particular 77 protein, or an antibody against the marker gene 
product, where this gene is expressed. Finally, in situ analysis (such as fixing the cells 
and labeling with antibody) and/or FACS (fluorescence activated cell sorting) analysis 

35 of various cells from the offspring can be conducted using suitable antibodies to look for 
the presence or absence of the knockout construct gene product. 
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Yet other methods of making knock-out or disruption transgenic animals are also 
generally known. See, for example. Manipulating the Mouse Embryo. (Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1 986). Recombinase dependent 
knockouts can also be generated, e.g. by homologous recombination to insert target 
sequences, such that tissue specific and/or temporal control of inactivation of a 77-gene 
can be controlled by recombinase sequences (described infra). 

Animals containing more than one knockout construct and/or more than one 
transgene expression construct are prepared in any of several ways. The preferred 
manner of preparation is to generate a series of mammals, each containing one of the 
desired transgenic phenotypes. Such animals are bred together through a series of 
crosses, backcrosses and selections, to ultimately generate a single animal containing all 
desired knockout constructs and/or expression constructs, where the animal is otherwise 
congemc (genetically identical) to the wild type except for the presence of the knockout 
construct(s) and/or transgene(s) . 

The present invention is further illustrated by the following examples which 
should not be construed as limiting in any way. The contents of all cited references 
(including literature references, issued patents, published patent applications as cited 
throughout this application are hereby expressly incorporated by reference. 

The practice of the present invention will employ, unless otherwise indicated 
conventional techniques of cell biology, cell culture, molecular biology, transgenic ' 
b.ology, microbiology, recombinant DNA, and immunology, which are within the skill 
of the art. Such techniques are explained fully in the literature. See, for example 
Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and 
Maniatis (Cold Spring Harbor Laboratory Press: 1989); DNA Cloning, Volumes I and II 
(D. N. Glover ed., 1 985); Oligonucleotide Synthesis (M. J. Gait ed., 1 984); Mullis et al 
U.S. Patent No: 4,683,195; Nucleic Acid Hybridization (B. D. Hames and S J Higgi ns 
eds.; Transcription And Translation (B. D. Hames & S. J. Higgins eds. 1984) Culture 
Of Animal Cells (R. I. Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And 
Enzymes (IRL Press, 1986); B. Perbal„4 Practical Guide To Molecular Cloning (1984)- 
the treatise, Methods In Enzymology (Academic Press, Inc., N. Y.); Gene Transfer 
Vectors For Mammalian Cells (J. H. Miller and M. P. Calos eds., 1 987. Cold Spring 
Harbor Laboratory); Methods In Enzymology, Vols. 1 54 and 1 55 (Wu et al eds ) 
Immunochemical Methods In Cell And Molecular Biology (M a y C r and Walker eds 
Academic Press, London, 1987); Handbook Of Experimental Immunology Volumes I- 
IV (D. M. Weir and C. C. Blackwell, eds., 1986); Manipulating the Mouse Embryo 
(Cold Spring Harbor Laboratory Press, Cold Spring Harbor. N.Y.. 1986). 



WO 98/12302 



PCT/US97/15627 



-66- 



EXAMPLES 

Identification of Tub lnteractors 

The following materials and methods were used in the Examples: 

5 Yeast strains, Media, and Microbiological Techniques 

Standard yeast media including synthetic complete medium lacking L-lcucine, L- 
tryptophan, and L-histidine were prepared and yeast genetic manipulations were 
performed as described (Sherman (1 99 1)M?/ A. Enzymol 194:3-21). Yeast 
transformations were performed using standard protocols (Gietz et al. (1 992) NucL Acids 
10 Res. 20:1425. Ito et al, (1983) J. Bacterioi 153:163-168). Plasmid DNAs were isolated 
from yeast strains by a standard method (Hoffman and Winston (1987) Gene 57:267- 
272). 



Western Blotting 

15 A total protein extract of TBI 4 and TB20 was subjected to Western blotting 

analysis to confirm and qualitatively evaluate expression of the GAL4 DNA-binding 
domain TUB fusion proteins. The protein extract were prepared by growing TBI 4 and 
TB20 in synthetic complete medium lacking L-tryptophan (Sherman (1991) Meth. 
Enzymol 194:3) to an ODgQO °f 1- The yeast cells from 4.5 ml of culture were 

20 collected by centrifugation and the cell pellet was resuspended in 1 ml of 0.25 M NaOH 
1% beta-mercaptoethanol and incubated at 4°C for 10 minutes. 160 ml of 50% TCA 
were then added to the cell suspension and after mixing the suspension was incubated at 
4°C for 10 minutes. The suspension was then microfuged at 4°C for 10 minutes, the 
supernatant fraction was discarded, and the pellet was washed with cold acetone, air 

25 dried, and then resuspended in 1 20 ml of 2X tris-glycine SDS sample buffer (Novex, 
San Diego, CA) diluted to IX strength with deionized water. 

1 5^1 of the sample was boiled for 2 minutes and then electrophoresed on a 14% 
tris glycine SDS polyacrylamide gel (Novex) and then transferred to an immobilon 
PVDF membrane (Millipore; San Francisco, CA). The primary antibody utilized was a 

30 rabbit anti-yeast GAL4 DNA-binding domain polyclonal antibody (Upstate 

Biotechnology Inc., Lake Placid, NY) and the secondary antibody was a donkey anti- 
rabbit Ig, peroxidase linked species-specific whole antibody (Amersham Life Sciences, 
Cleveland, OH). Western blotting procedures were essentially as described (Sambrook 
et al. Molecular Cloning 2nd edition. Cold Spring Harbor Laboratory Press. 1989) and 

35 proteins interacting with the antibodies were visualized using the ECL detection system 
(Amersham Life Sciences, Cleveland, OH), essentially as described by the manufacturer. 
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Expression of the GAL4 DNA-binding domain TUB cytoplasmic domain fusion 
proteins were detected. 



15 



20 



Beta Galaclosidase Assays 

The f,lter disk beta-galactosidase (beta-gal) assay was performed essentially as 
previously described (Brill eial. (1994) Mrf. Biol. Cell. 5:297-312). Briefly strains to 
be tested were grown as patches of cells on appropriate medium dictated by the 
experiment at 30°C overnight. The patches or colonies of cells were replica plated to 
Whatman #50 paper disks (Schleicher & Schuell, #576; Keene, NH) that had been 
placed on the test medium in petri dishes. After growth overnight at 30°C, the paper 
disks were removed from the plates and the cells on them were permeabilized by 
.mmediately immersing them in liquid nitrogen for 30 seconds. After this treatment the 
paper d,sks were thawed at room temperature for 20 seconds and then placed in petri 
d.shes that contained a disk of Whatman #3 paper (Schleicher & Schuell #593 Keene 
NH) saturated with 2.5 ml of Z buffer containing 37 M I of 2»/ D weight per volume of the' 
chromogenic beta-gal substrate X-gal. The permeabilized strains on the paper disks 
were mcubated at 30°C and inspected at timed intervals for the blue color diagnostic of 
beta-gal activity in this assay. The assay was stopped by removing the paper disk 
containing the patches of cells and air drying it. 

Two Hybrid Screening and Identification of Tub Interactors 

Human TUB 184-506 and human TUB 1-506 were cloned into pGBT9 
(Clontech, Palo Alto, CA). The human TUB 1 84-506 was called pGBhTUB and the 
human TUB 1-506 clone was called pMB71. pGBhTUB and P MB71 were transformed 
mto two-hybrid screening strain HF7c. A pGBhTUB transformant was called TBI4 and 
a pMB71 transformant was called TB20. It was verified that neither human TUB 1 84- 
506 nor human TUB 1-506 activated the HIS3 or lacZ reporter genes present in HF7c 

TuZT^T ^ ^ TB2 ° ^ SUbjeCted ,0 ^ Wot Hu— 
TUB 184-506 was expressed at a high level and human TUB 1-506 was expressed at a 
30 very low level. 

Jn one experiment, TBM was transformed with a human prostate two-hybrid 
horary and 20 million transforms were obtained and in another experiment TB 1 4 was 
transformed with a mouse T-ce.l library and 10 million transforms were obtained 
TB20 was transformed with a human prostate two-hybrid library and 1 5 million 

IhTlTnrT By TVB -^ P»-rs, * was determined 

that the TUB cDNA was present in these libraries. Transforms were plated on 
synthenc complete medium lacking leucine, tryptophan, and histidine to select for 
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transformants expressing cDNA library plasmids encoding TUB-interacting proteins. 
All colonies that grew on the selective plates were analyzed for beta-galactosidase 
expression using the filter bcta-galactosidase assay and the strongest beta-galactosidase 
expressing plasmids from each screen were analyzed. 
5 In the screen where TBI 4 was transformed with a human prostate two-hybrid 

library, E. coli plasmid ptyhq058; £. coli plasmid ptyhq054 and £ coli plasmid 
ptyhq036 were identified. In the screen where TB14 was transformed with a mouse T- 
cell library, £ coli plasmid ptyhtlOl and £. coli plasmid ptyht!02, the mouse 
homologues of £ coli plasmid ptyhq036 and E. coli plasmid ptyhq054 were identified. 

10 In the screen where TB20 was transformed with the human prostate library, £ coli 
plasmid ptyhq049 and human serine palmitoyl transferase (GenBank Accession No. 
U 15555) were identified. Human serine palmitoyl transferase is a weak interactor 
because it activates the HI S3 reporter gene but not the lacZ gene, at least not enough to 
be detected in the assays. £ coli plasmid ptyhq058 appeared to be the strongest 

15 interactor. All seven of these interactors bind to full length human and mouse TUB and 
the carboxyl-terminus of human and mouse TUB. In addition, none of these interactors 
bind to the carboxyl-terminus of human and mouse TUB missing the final 44 amino 
acids, amino acids lacking in the mutated mouse TUB gene. These seven interactors 
were found to not bind to several test proteins showing that they bind specifically to 

20 TUB. 



Northern Analysis 
Methods 

Total RNA was isolated from various mouse (C57BL/6 wild type and tub/tub) 
25 tissues using RNAzol B (Tel-Test, Inc., Friendswood, Texas). Poly A+ RNA was 
isolated from a variety of human and mouse cell lines using the FastTrack system 
(Invitrogen, San Diego, CA). Extracted RNA was electrophoresed through a 
formaldehyde gel, transferred to Genescreen nylon membrane (NEN Research Products, 
Boston, MA) and cross-liked using a Stratalinkcr apparatus (Stratagene, La Jolla, CA). 
30 For probing northern blots, 50 ng of the following probes were labelled using 

Prime-It (Stratagene, La Jolla, CA): human ank; human tpr; human ring; mouse tpr; or 
mouse ring. Blots were hybridized at 65°C in Church Buffer overnight and washed in 
0.2X SSC/0. 1% SDS also at 65°C. Filters were exposed to film (X-omat AR. Kodak) 
for 1 8-36 hours. 
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Human Tissue Results 



Human multiple tissue northern blots (Clontech, Palo Alto, CA) were probed 
The human tissues tested included: spleen, thymus, prostate, testes, uterus small 
intestine, colon, peripheral blood leukocytes, heart, brain, placenta, lung, liver, skeletal 
muscle, kidney, pancreas and retina. 

Bands of approximately 2.4 kb and 1 0 kb were found to be ubiquitously 
expressed in all tissues tested using the human ank probe. The 2.4 kb band in retinal 
tissue gave an increased signal. Using the human ring probe, bands of 1 3kb and 2 kb 
were expressed in all tissues tested. The 2 kb band gave an increased signal in retinal 
tissue. Hybndization with the human ring probe yielded bands of 3kb and 4kb in all 
tissues tested. An additional band of 1 .4 kb was detected in testes. 

Mouse Tissue Results 

Mouse tissues were obtained from C57BL/6 and tub/tub animals. Tissues used 
were: brain, hypothalamus, liver, heart, spleen, stomach, kidney, muscle, fat, and testes 
Neither the human ring probe nor the human ank probe yielded any signal in any tissue 
tested, the mouse tpr probe hybridized with a 1 .4 kb band in C57BL/6 testes and a 1 4 
kb band m tub/tub brain and testes. The mouse ring probe hybridized with a 2.4 kb and 
a 3.0 kb band m all tissues tested from both strains of mice and also hybridized with a 
20 1 .4 kb band from testes tissue from C5 7BL/6 and tub/tub mice. 

Cell Line Results 

Poly A+ RNA was isolated from a variety of ATCC cell lines (including human 
cell lines SHEP; SHSY5Y; SKNMC (neuroblastoma); SKNSH- Neuro 2A 
(neuroblastoma), NB412A/8; the human breast carcinoma cell line MCF7 and the mouse 
fibroblast cell line NIH 3T3). The human ank probe hybridized with a 2.3 kb band in 
«he SHEP, SHSY5Y, SKNMC, SKNSH, and MCF7 cell lines. The same human ank 
probe l,t up a 2 kb band in Neuro 2A and NB412A/8 cells. No signal was detected in 
the 3T3 cell line. The human tpr probe hybridized with a 2 kb band in all cell lines 
tested. An additional band of 4.4 kb was detected using this probe in the neuro 2A cells 
The human nng probe detected a 2.4 kb band in the SHEP, SIISY5Y. SKNMC and 
SKNSH cell lines. No signal was detected in any other of the cell lines using the rin E 
probe. b 
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Deposit of Microorganisms 

£. coli plasmid ptyhq049 was deposited with the American Type Culture 
Collection Rockville, MD, on August 6 1996 under the terms of the Budapest Treaty and 
assigned Accession Number 98125 (hTI-1). 
5 £ coli plasmid ptyhq058 was deposited with the American Type Culture 

Collection Rockville, MD, on August 6 1996 under the terms of the Budapest Treaty and 
assigned Accession Number 98127 (hTI-2). 

£ colt plasmid ptyhq036 was deposited with the American Type Culture 
Collection Rockville, MD, on August 6 1996 under the terms of the Budapest Treaty ar 
1 0 assigned Accession Number 98 1 28 (hTI-3). 

£ coli plasmid ptyhq054 was deposited with the American Type Culture 
Collection Rockville, MD, on August 6 1996 under the terms of the Budapest Treaty and 
assigned Accession Number 98126 (hTI-4). 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



10 



15 



20 



(i) APPLICANT: 

(A) NAME: MILLENNIUM PHARMACEUTICALS INC 

(B) STREET: 640 MEMORIAL DRIVE 

(C) CITY : CAMBRIDGE 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: US 

(F) POSTAL CODE (ZIP) ; 02139 

(G) TELEPHONE: 

(H) TELEFAX: 

(ii) TITLE OF INVENTION : WEIGHT CONTROL PATHWAY GENES AND USES 

THEREFOR 

(iii) NUMBER OF SEQUENCES: 36 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LAHIVE & COCKFIELD, LLP 

(B) STREET: 28 STATE STREET 
? <C) CITY: BOSTON 

5 < D > STATE: MASSACHUSETTS 

<E) COUNTRY: US 
(F) ZIP: 02109 



30 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
<B) COMPUTER: IBM PC compatible 
(O OPERATING SYSTEM; PC-DOS /MS -DOS 
(D) SOFTWARE: Patentm Release #i.o, Version #1.25 

35 < vi > CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US97 / 

(B) FILING DATE: 

(C) CLASSIFICATION: 

40 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/897 34 0 

(B) FILING DATE: 21 JULY 1997 

(C) APPLICATION NUMBER: US 08/715,032 
45 (D > FILING DATE: 17 SEPTEMBER 1996 

Cviii) ATTORNEY/ AGENT INFORMATION : 

(A) NAME: SILVERI, JEAN M. 

(B) REGISTRATION NUMBER: 3 9,030 
(O REFERENCE/DOCKET NUMBER: MNI- 005CPPC 



50 



55 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617)227-7400 

(B) TELEFAX ; (617)742-4 214 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 86 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



15 


GAATTCGGCA 


CGAGCGCACT 


CGCAGCCCTG 


GCAGGCGGCA 


CTGGTCATGG 


AAAACGAATT 


60 




GTTCTGCTCG 


GGCGTCCTGG 


TGCATCCGCA 


GTGGGTGCTG 


TCAGCCGCAC 


ACTGTTTCCA 


120 


ISJ 


GAAGTGAGTG 
AGAGCCAGGG 


CAGAGCTCCT 
AGCCAGATGG 


ACACCATCGG 
TGGAGGCCAG 


GCTGGGCCTG 
CCTCTCCGTA 


CACAGTCTTG 
CGGCACCCAG 


AGGCCGACCA 
AGTACAACAG 


180 
240 




ACCCTTGCTC 


GCTAACGACC 


TCATGCTCAT 


CAAGTTGGAC 


GAATCCGTGT 


CCGAGTCTGA 


300 


25 


CACCATCCGG 


AGCATCAGCA 


TTGCTTCGCA 


GTGCCCTACC 


GCGGGGAACT 


CTTGCCTCGT 


360 




TTCTGGCTGG 


GGTCTGCTGG 


CGAACGGCAG 


AATGCCTACC 


GTGCTGCAGT 


GCGTGAACGT 


420 


30 


GTCGGTGGTG 
GTTCTGCGCC 


TCTGAGGAGG 
GGCGGAGGGC 


TCTGCAGTAA 
AAGACCAGAA 


GCTCTATGAC 
GGACTCCTGC 


CCGCTGTACC 
AACGGTGACT 


ACCCCAGCAT 
CTGGGGGGCC 


480 
540 




CCTGATCTGC 


AACGGGTACT 


TGCAGGGCCT 


TGTGTCTTTC 


GGAAAAGCCC 


CGTGTGGCCA 


600 


35 


AGTTGGCGTG 


CCAGGTGTCT 


ACACCAACCT 


CTGCAAATTC 


ACTGAGTGGA 


TAGAGAAAAC 


660 




CGTACCAGGC 


CAGTTAACTC 


TGGGGACTGG 


GAACCCATGA 


AATTGACCCC 


CAAATACATC 


720 


40 


CTGCGGAAGG 
AGGCCCCCAG 


AATTCAGGAA 
CCCCTCCTCC 


TATCTGTTCC 
CTCAAACCAA 


CAGCCCCTCC 
GGGTACAGAT 


TCCCTCAGGC 
CCCCAGCCCC 


YCAGGAGTCC 
TCCTCCCTCA 


780 
840 




GACCCAGGAG 


TCCAGACCCC 


CCAGCCCCTC 


CTCCCTCAGA 


CCCAGGAGTC 


CAGCCCCTCC 


900 


45 


TCCCTCAGAC 


CCAGGAGTCC 


AGACCCCCCA 


GCCCCTCCTC 


CCTCAGACCC 


AGGGGTCCAG 


960 




CCTCTCCTCC 


CTCAGACCCA 


GGAGTCCAGA 


CCCCCCAGCC 


CCTCCTCCCT 


CAGACCCAGG 


1020 


50 


AGTCCAGCCC 


CTCCTCCCTC 


AGACCCAGGA 


GTCCAGATCC 


CCCAGCCCCT 


CCTCCCTCAG 


1080 


ACCCAGGGGT 


CCAGGCCCCC 


AACCCCTCCT 


CCCTCAGACT 


CAGAGGTCCA 


AGCCCCCAAC 


1140 




CCCTCCTTCC 


CCAGACCCAG 


AGGTCCAGGT 


ACCAGCCCCT 


CCTCCCTCAG 


ACCCAGCGGT 


1200 


55 


CCAATGCCAC 


CTATACTCTC 


CCTGTACANA 


TTGCCNC CTT 


GTGGCACGTT 


GACCCAACCT 


1260 
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TACCAGTTGG TTTTTCATTT TTTGTCCCTT TCCCCTAGAT CCAGAAATAA AGTTTAAGRG 1320 
RAGSGCCAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAACYCG 1380 
AGAANT 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2103 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



1386 



30 



40 



60 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GGATCCGAAT TCGGCACGAG GCGGAGGGAA GTAGGTCCGT TGGTCGGTCG GGAACGAGGC 

^ TCAGGCGGCC AGGCCCGCGC GGAGCCGTTG CCATGGCAG C CGCCGCCGGG GACGCGGACG 120 

ACGAGCCGCG CTCAGGCCAC TCGAGCTCGG AGGGCGAGTG CGCGGTGGCG CCGGAGCCGC 180 

TGACTGACGC TGAGGGCCTC TTCTCCTTCG CTGACTTCGG GTCTGCGCTG GGCGGCGGCG 240 

GCGCGGGCCT CTCGGGCCGG GCGTCCGGCG GGGCCCAGTC GCCGCTGCGC TACTTGCACG 300 

TCCTGTGGCA GCAGGATGCG GAGCCGCGCG ACGAGCTGCG CTGCAAGATA CCCGCTGGCC 360 

^ GGCTGAGGCG CGCTGCCAGG CCCCACCGGC GGCTCGGGCC CACGGGCAAG GAGGTGCACG 420 

CTCTGAAGAG ACTGAGGGAC TCGGCCAATG CCAATGATGT GGAAACAGTG CAGCAGCTGC 480 

TGGAAGATGG CGCGGATCCC TGTGCAGCTG ATGACAAGGG CCGCACAGCT CTACACTTTG 540 

CCTCATGCAA TGGCAATGAC CAGATTGCTG CTCCTGGACC ATGGTGCTGA TCCTAACCAG 600 

CGAGATGGGC TGGGGAACAC GCCACTGCAC CTGGCGGCCT GCACCAACCA CGTTCCTGTC 660 

^ ATCACCACAC TGCTACGAGG AGGGGCCCGT GTAGATGCCC TGGACCGAGC TGGTCGCACA 720 
CCCCTGCACC TGGCCAAGTC AAAGCTGAAT ATCCTGCAGG AGGGCCATGC CCAGTGCCTA 
GAGGCTGTGC GTCTGGAGGT GAAGCAGATC ATCCATATGC TGAGGGAGTA TCTGGAGCGC 
50 CTAGGGCAAC ATGAGCAGCG AGAACGCCTG GATGACCTCT GCACCCGCCT G CAG ATG ACC 
AGTACCAAAG AGCAGGTGGA TGAAGTGACT GACCTCCTGG CCAGCTTCAC CTCCCTCAGT 
^ CTGCAGATGC AGAGCATGGA GAAGAGGTAG CAAGAGAGGC TCCCTGCCTT CCTGCCACTG 1020 



780 
840 
900 
96 0 
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CCCCACCCTG CCCCACTGCT GTCTCAGTAC CAAGAAAAAG CCCAACATCT GGGACTTGGA 1080 

GCTGCACTTG TCTGGTGAGG ACCTTGCCCT CACCCGCACA TGCCGTGGGG CAGAGATGCT 114 0 

5 CTCTCTCCAC GGCCTCAGAG CCACTCCCAG CCACAGTTTC CAGCATCTCT GTGGACAGGG 1200 

ACCACAGCTC CCAGCTTCTT CCAGTTCTCG CAGCACCAGA CCAGCCTCTG CAGCTGCACT 1260 

TCAGCTCCGC AGACCTGCGC TATCTCAGCA GACCTCACTT GCCCCATGGC CTTCATGGCG 1320 

10 

CGCTCCAGGC CTCAGACCCT TCTCTGTGTT CCGTCCTGGC CATGGGCTTG TTGCAGTCAG 13 BO 

CAGGTGTGGG CTTAGGCGGG CACCCTGTGG CCAGGGGTAC TGCGTGAGGC CCTCAGTTGG 144 0 

15 TCCTGTGCCT CTCACCAGCA CTTAGACAGA CACGTCACCA GACTTTCAAG GAGATACTGC 1500 

AGTGAGTTTC TCTGGTTGGA AGGGGAGGGT TGGTGAGTCC CAGACCTTAA AAATACAAGG 1560 

TTAAGAGGGA CCCCAAAGCA AAAAATTCCA ACCCTTTTCC TCCCAGTCAT TGAAACACCA 1620 

20 

AAACTATTAT ACCGGAGGGT GTAATAGTTT TGCTGCCCAG TTGTG GTAGG CCAGTAGTGG 1680 

CCTCCCAAGA TGCCCATGTC CTAATCCCAG GAACCTGTCA AAATTACCTT GTATGGCCAA 1740 

25 AGGGGCTTTG CAGATGTAAT GAAGTTAAGG ATCTTTCGCC AGGAAGATTA TCCCAGCTTG 1800 

TTCAGGAGGG CTTGATGTCC TCACCCGGGT CTGTATAACA GAAGAGCAGG TGACGGGAGA 1860 

GGAGGTTGGA GGTGTAGCGA TGGAGCAGGA AACTGGAGTT GAGGAGGGCA GCTCAAGCCA 1920 

30 

CAGAGTCCAG GCCACCTCAG AGCCAGGAAA TGCATCCTCC CACAGAGCCC TGGAAGGCCC 1980 

CAGCCCTGCT CCCACCTGGA CTGGCTCAGT GAGGCTAATT TTATAATTCT GGCTGATTTT 204 0 

35 AGAACTCTAA GGGAATAAAT TTGTGTTGTT TTAAGTCAAA AAAAAAAAAA AAAAAAACTC 2100 

GAG 2 ^-°3 
(2) INFORMATION FOR SEQ ID NO: 3: 

40 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AATTCGGCAC GAGAAAAATG CTAGCTATTA TGGTAATCGA GCAGCCACCT TGATGATGCT 60 
55 TGGAAGGTTC CGGGAAGCTC TTGGAGATGC ACAACAGTCA GTGAGGTTGG ATGACAGTTT 120 
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10 



180 
240 
300 
360 
420 



TGTCCGGGGA CATCTACGAG AGGGCAAGTG CCACCTCTCT CTGGGGAATG CCATGGCAGC 
ATGTCGCAGC TTCCAGAGAG CCCTAGAACT GGATCATAAA AATGCTCAGG CACAACAAGA 
GTTCAAGAAT GCTAATGCAG TCATGGAATA TGAGAAAATA GCAGAAACAG ATTTTGAGAA 
GCGAGATTTT CGGAAGGTTG TTTTCTGCAT GGACCGTGCC CTAGAATTTG CCCCTGCCTG 
CCATCGCTTC AAAATCCTCA AGGCAGAATG TTTAGCAATG CTGGGTCGTT ATCCAGAAGC 
ACAGTCTGTG GCTAGTGACA TTCTACGAAT GGATTCCACC AATGCAGATG CTCTGTATGT 480 
^ ACGAGGTCTT TGCCTTTATT ACGAAGATTG TATTGAGAAG GCAGTTCAGT TTTTCGTACA 5 40 
GGCTCTCAGG ATGGCTCCTG ACCACGAGAA GGCCTGCATT G CCTGCAGAA ATGCCAAAGC 600 
ACTCAAAGCA AAGAAAGAAG ATGGGAATAA AGCATTTAAG GAAGGAAATT ACAAACTAGC 660 
ATATGAACTG TACACAGAAG CCCTGGGGAT AGACCCCAAC AATATAAAAA CAAATGCTAA 7 20 
ACTCTACTGT AATCGGGGTA CGGTTAATTC CAAGCTTAGG AAACTAGATG ATGCAATAGA 780 
25 AGACTG CACA AATGCAGTGA AGCTTGATGA CACTTACATA AAAGCCTACT TGAGAAGAGC 840 
TCAGTGTTAC ATGGACACAG AACAGTATGA AGAAGCAGTA CGAGACTATG AAAAAGTATA 900 
CCAGACAGAG AAAACAAAAG AACACAAACA GCTCCTAAAA AATGCGCAAC TTAAGTTTAG 960 
AAATTACAAG TTTCAGTAAT AGCTGAACCT GTTCAAAATG TTAATAAAGG TTTCGTTGCA 1020 
TGGTAGCATA AAAAAAAAAA AAAAAAAA 
(2) INFORMATION FOR SEQ ID NO: 4: 



20 



30 



35 



40 



45 



1048 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1700 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

TCGAGATTTA CCCATAGATA TGTGTCCTAA CAATGCCAGC TATTACGGTA ATCGAGCGGC 
50 CACACTGATG ATGCTTGGAC GGTTCCGGGA AGCTCTTGGA GATGCGCAGC AGTCTGTGAG 120 

GTTGGATGAC AGTTTTGTCC GGGGACACCT CCGAGAAGGC AAGTGCCACC TCTCACTTGG 180 
^ GAATGCAATG GCGGCATGTC GTAGTTTCCA AAGAGCCCTA GAACTGGATC ATAAAAATGC 240 
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CCAGGCACAG CAGGAGTTCA AGAACGCCAA TGCCGTCATG GAGTATGAGA AAATAGCAGA 300 

AGTGGATTTT GAAAAGCGAG ATTTCCGGAA GGTTGTTTTC TGCATGGACC GTGCCCTAGA 36 0 

5 ATTTGCCCCT GCCTGCCATC GATTCAAAAT TCTCAAAGCA GAATGTTTAG CAATGCTTGG 4 20 

TCGATACCCA GAAGCACAGT TTGTGGCCAG TGACATTTTA CGAATGGATT CCACCAATGC 480 

TGATGCTCTG TATGTCCGGG GTCTTTGCCT TTATTACGAA GATTGTATTG AGAAGGCAGT 540 

10 

GCAGTTTTTT GTACAGGCTC TCAGGATGGC TCCTGACCAC GAGAAGGCTT GTGTCGCTTG 600 

TAGAAATGCC AAAGCCCTTA AAGCCAAGAA GGAAGATGGG AATAAAGCCT TTAAGGAAGG 66 0 

15 AAATTACAAG CTAGCATATG AACTGTACAC AGAAGCCTTG GGGATAGATC CCAACAACAT 72 0 

AAAAACAAAT GCTAAACTCT ACTGTAATCG GGGTACGGTT AATTCCAAGC TTAGGCAACT 780 

GGAAGATGCC ATAGAAGACT GTACAAATGC GGTGAAGCTC GATGACACTT AC AT C AAAG C 84 0 

20 

CTACCTGAGA AGAGCTCAGT GTTACATGGA CACAGAGCAG TTTGAAGAAG CCGTGCGGGA 900 

CTATGAAAAA GTGTATCAGA CGGAGAAAAC AAAAGAACAC AAACAGCTCC TTAAGAATGC 96 0 

25 ACAGCTGGAA CTGAAGAAGA GCAAGAGGAA AGAT TACT AC AAGATCCTGG GAGTGGACAA 1020 

GAATGCCTCT GAGGACGAGA TCAAGAAAGC TTACCGGAAA CGGGCCTTGA TGCACCATCC 1080 

AGATCGGCAC AGTGGGGCCA GTGCCGAAGT TCAGAAGGAG GAGGAGAAGA AGTTTAAGGA 1140 

30 

AGTGGGAGAG GCCTTTACCA TCCTCTCTGA TCCCAAGAAA AAGACTCGTT ATGACAGTGG 1200 

ACAGGACTTG GATGAGGAGG GCATGAATAT GGGCGATTTT GATGCAAACA ACATCTTCAA 1260 

35 GGCATTCTTC GGTGGTCCTG GGGGCTTCAG CTTTGAAGCA TCTGGCCCAG GGAATTTCTA 1320 

CTTTCAGTTT GGCTAATGAA GGCCAACTAC TTAAAACCCA GAAAATGCAG ACTTG CTTGG 1380 

TTTAACCATG AGTGTGGACA GTTCACTTCC TCCATCATGT CCCTGTGTAC TTATAG CAGT 144 0 

40 

NTCGTTTTCT CAGTCGGGTG CCCTGTGTCT GTATGAGGGG TGAAKGAAAG GGGGCCAGTG 1500 

CTGAGGACTA GGGAGGGATG GAAGCCANGG GTAKACAGGG AAGCAGGCAG CTTGTGAATT 156 0 

45 TTTGTTGTAT TGTTTAACTT TATTAAAAAA GAAAAACAAT ACTGTAAAWT WTAAAAAGGA 1620 

AAAGRATTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 16 80 

AAAAAAAAAA AAAGGTAAAT 1700 

50 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 8 base pairs 
55 (B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO.-5: 
10 ACGAGCGGTG ACGGCCGGGT AGGCTGTAGG CAGCGCAATG CCAAGACAGA GCTGCTGGCG 60 
GCGGCGGGCG AATCTCCCTG CACCATGAGC CTCGGCTCCG GCCCCGTTAG GGGCCGATAA 120 
js GCACAGCGCA CGCCGCCCTC CATTTGCCCC GGGGCCTCGG CTGCGAAGAT AGCGGCGGCC 180 
GGACAGGAAG CTCGAGGAAA GCGCTGGGCC GGGTCTCTAC GAACACGTGA AGGAAAAGCA 240 
GCTCCGTCCA CAACGCCGCT TCGGGGCTCC TAGGGAGTCG GGCCCCGGGC CGCCACCGTC 300 
20 ACCTCCGGCC GCTGCCGCTG TCGCCATCGC CTTGTTTCCC CATCCCCCGC CATGGCCGAG 360 
GACCTCTCTG CGGCCACGTC CTACACCGAA GATGATTTCT ACTGCCCCGT CTGTCAGGAG 420 
^ GTGCTCAAAA CGCCCGTGCG GACCACGGCC TGTCAGCACG TTTTCTGTAG AAAATGTTTC 480 
CTGACTGCAA TGAGGGAAAG CGGAGCACAT TGTCCCCTAT GTCGTGGAAA TGTGACTAGA 540 
AGAGAGAGAG CATGTCCTGA ACGGGCCTTA GACCTTGAAA ATATAATGAG GAAGTTTTCT 6 00 
30 GGTAGCTGCA GATGCTGTGC AAAACAGATT AAATTCTATC GCATGAGACA TCATTACAAA 660 
TCTTGTAAGA AGTATCAGGA TGAATATGGT GTTTCTTCTA TCATTCCAAA CTTTCAGATC 720 
^ TCTCAAGATT CAGTAGGGAA CAGCAATAGG AGTGAAACAT CCACATCTGA TAACACAGAA 780 
ACTTACCAAG AGAATACAAG TTCTTCTGGT CATCCTACTT TTAAGTGTCC CCTGTGTCAA 840 
GAATCAAATT TTACCAGACA GCGTTTACTG GATCACTGTA ACAGTAATCA CCTATTTCAG 900 
40 ATAGTTCCTG TGACATGTCC TATTTGTGTG TCTCTTCCTT GGGGAGATCC TAGCCAGATT 960 
ACCAGAAATT TCGTTAGTCA TCTAAATCAG AGACATCAAT TTGATTATGG AGAATTTGTG 1020 
^ AATCTTCAGC TAGATGAAGA AACCCAATAC CAAACTGCTG TTGAAGAATT TTTTCAAGTA 1080 
AACATTTGAA GGCTGTAGAC ATTTTTGCAT TTTTGTACCT GCAAGTGCCA TCTTTAAGGG 1140 
GGAAAMTACA TGAAGTCACC GTTACAGTAA CTTGATGTGT ATATTAATAA AAGTAATTCA 1200 
50 GTCMAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 1248 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
55 <A> LENGTH: 2121 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

10 

AGTTCACCTA CCACCACCAC CTCGGCTCCT GCCGGCGCCG TCGCCTCTCC CGCCCACCCC 60 

TCGCCATGTC CGAGGAACTT TCGGCGGCCA CGTCCTACAC GGAAGATGAT TTCTACTGCC 120 

15 CTGTCTGTCA GGAGGTGCTC AAGACGCCGG TGCGGACCGC GGCCTGTCAG CACGTTTTCT 180 

GTAGAAAATG TTTC CTGACT GCAATGAGAG AAAGTGGAAT ACATTGTCCC CTATGTCGTG 240 

GAAGTGTGAC TAGAAGAGAA AGAGCATGTC CGGAACGGGC CTTAGATCTT GAAAATATCA 300 

20 

TGAGGAGGTT TTCTGGTAGC TGCAGATGCT GTTCAAAAAA GATTAAATTC TATCGCATGA 360 

GACATCATTA CAAATCTTGT AAGAAGTATC AGGATGAATA TGGTGTTTCT TCTGTCATTC 420 

25 CAAACTTTAA GATTTCTCAA GATTCAGTAA GGAGCAGTAA TAGGAGTGAA ACATCTG CAT 480 

CTGATAACAC AGAAACTTAT CAAGAGGATA CAAGTTCTTC TGGGCATCCT AC C TTTAAGT 540 

GTCCCTTATG TCAAGAGTCA AAT TTCACCA GACAACGTTT ATTGGATCAC TGTAATAGTA 600 

30 

ACCACCTATT TCAGATAGTT CCTGTGACAT GTCCTATTTG TGTGTCTCTT CCTTGGGGAG 660 

ATCCTAGCCA GATTACTAGA AATTTCGTTA GTCATCTAAA TCAAAGACAT CAGTTTGATT 720 

35 ATGGAGAATT TGTGAATCTT CAGCTAGATG AGGAAACCCA ATATCAAACT GCTGTGGAAG 780 

AGTCTTTTCA AGTAAACATG TGACATGTAT AGACATCTCT GCCTCCTTGC AACCTACAAG 840 

TGCCATCTTT AAGGAGAAGA CATGAAGTCA CCATTTTCAG TAATTTGCTG TGCATATTAA 900 

40 

TAAAAATAAT AATTCAGTCT ACTGTATTAG GTTTTTAATT GAAAATAAAG GTGGGCCACC 960 

CTAATACCAT TCTCTAGACA GTTACTTTAA CAGCATGGAA AGGGTTGTAT TTCACTTGTG 1020 

45 TGGTGAAAAG AGAATCTCTG TTGTCTTTTT CTTCCTTGTA TTACATATTC TCAATGTTTC 1080 

ATTAAGTTGT TTTTGGTATT TG ATATAGTT CCTTCTGTTT AGTACAGAGA TAACAGCAAA 1140 

TTCTGAACGA TGTGATTCTT AAAAAGCTAA TAAACCTGAG CCATTTGTCA GAG CTGTAGA 1200 

50 

ATGGAAACTT GAAGTGTGAA GTGGGATAAT CCAAAGGGAT TTTTTTTTAA AGTATAGATT 1260 

CTAGCTGAGG AATTCAACAA TAAGAAAGTT GTATTTATGT AATGTTTAGT ATTTTTGAAG 1320 

55 ACTAGTGAGA TTTCTTTAAT AATTTTTACT TTGAAAGCAT AT TGTACAAA TGTTTCTTCT 1380 
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TTTGCTATTA GAAGAACATC AAGAGAAGTT TCCTTTGGTG GTTAGTTTGT TATTTCAATC 1440 

5 TAGGTTGAAT AATTTGTAAG CCTAAATGTT ATATACCACA GTTCTTTGTA GTCAGTATTT 1500 

CTCACTGGGT GATGAAACTT TTCAGCCAGT GAATGATACA TTCAATTAGT TTTTTAAAAA 1560 

TCCAAAGTTG CAGATGTATG TGGATATGTA CATAGACTTT TGCATGTATA TATACACATA 1620 

10 TATATCTTTG C CTAGAGTTT GTCAGGTTAT GTATAGAATT TCTATTAAAA AGTTTTAATA 1680 

ATGGACAAGC AATATAGGAT TGAAGTATTT ATCTCCTTTG TTTAAAATTT TGTATGTTAC 1740 

^ CAAGTTTTTA AAACAGTAAG CCAAATACTA TGTGGTACAG TTGG CTGTTA TTACACCTGA 1800 

AAAATGTTAA ATGGTGCTCA CTTGTTACGT TTGAAAATGA TGCATAACTG ACGTGTGGTG 1860 

AGAGATTTTA CCAGCTACTG TTTCACTACA TTTTAG TCAA AACAAAGTTT GTTCTTAATC 1920 

20 TTTGGTATAA AGTGTTGTAG AGAAGGCCAA GTCACAAAGT AAAGGGTGAA GGGGGAATTC 1980 

TGACATTCCA CACTAACATA ACACTGTTAT GCTTTCTTTA AAATAACTAA CCGCAAAAGA 2040 

^ AAATCTCTGA AGTAGTTTGC TGCTAATATA TACATATATT GTAAAAAAAA AAGGTATATT 2100 
TTGATTTTCT GGTAAATCTC G 



30 



40 



45 



50 



2121 



(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS; 
35 (A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



55 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7; 
CCTCGAGCTT CCTGTCC 
(2) INFORMATION FOR SEQ ID NO: 8: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



17 
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(ii) MOLECULE TYPE: cDNA 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

AUGGCGACUG AUGAGCUCAC UAGAGCGAAA GCGGCA 36 
(2) INFORMATION FOR SEQ ID NO : 9 : 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CGTGAAGGAA AAGCAGC 17 
25 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTAGAAATCA TCTTCGG 17 
(2) INFORMATION FOR SEQ ID NO: 11: 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



50 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
55 TGGAAACTAC GACATGCC 



18 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 
CCUUCCGCUG AUGAGCUCAC UAGAGCGAAA AAUCUC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



35 



40 



30 < xi > SEQUENCE DESCRIPTION: SEQ ID NO:13 : 

GTAGTTTCCA AAGAGCCC 

18 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
CATACTCCAT GACGGCA 
50 (2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: IB base pairs 
<B> TYPE: nucleic acid 
<C> STRANDEDNESS: single 



17 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TATGATCCAG TTCTAGGG 18 
(2) INFORMATION FOR SEQ ID NO: 16: 



10 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



40 



50 



(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

25 CAAAUUCCUG AUGAGCUCAC UAGAGCGAAA GGGCAC 36 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CAAGAATGCT AATGCAG 17 
(2) INFORMATION FOR SEQ ID NO: 18: 

45 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l8: 
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10 



15 



20 



30 



GCTTCTGGAT AACGACCC 

18 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 19: 
CGTACGGAGA GGCTGG 

16 

(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
<B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:20: 



GCACUGCCUG AUGAGCUCAC UAGAGCGAAA AGCAAU 
jj 36 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
4 ^ (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GAGCTCCTAC ACCATCGG 

18 

55 (2) INFORMATION FOR SEQ ID NO:22: 
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10 



15 



25 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TCAGACTCGG ACACGG 16 
(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

30 GGCGTCTTGA GCACCTCC 18 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 

40 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
G AC AAUGCU G AUGAGCUCAC UAG AG CG AAA UUCCAC 36 
(2) INFORMATION FOR SEQ ID NO: 25: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 
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(ii) MOLECULE TYPE: cDNA 



<*i) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
CCTGTCAGCA CGTTTTC 

(2) INFORMATION FOR SEQ ID NO: 26- 

10 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CCCGTTCCGG ACATGCTC 

25 



ii) 



15 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



W <*i) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

CCCTCAGCGT CAGTCAGC 
(2) INFORMATION FOR SEQ ID NO 28- 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
CAGCGCACUG AUGAGCUCAC UAGAGCGAAA CCCGAA 36 
5 (2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



15 



20 



30 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
GCGGAGCCGT TGCCATGG 18 
(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
35 TCAGCGAAGG AGAAGAGG 18 



40 2) INFORMATION FOR SEQ ID NO:3l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3l: 

Asn Ser Ala Arg Ala His Ser Gin Pro Trp Gin Ala Ala Leu Val Met 



10 
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1 

Glu Asn Glu Leu Phe Cys Ser Gly Val Leu Val His Pro Gin Trp Val 
5 20 25 30 

Leu Ser Ala Ala His Cys Phe Gin Lys Xaa Val Gin Ser Ser Tyr Thr 
35 40 45 

He Gly Leu Gly Leu His Ser Leu Glu Ala Asp Gin Glu Pro Gly Ser 

55 60 

Gin Met Val Glu Ala Ser Leu Ser Val Arg His Pro Glu Tyr Asn Arg 



15 



30 



35 



40 



45 



75 



80 



Pro Leu Leu Ala Asn Asp Leu Met Leu lie Lys Leu Asp Glu Ser Val 



85 



90 



95 



Ser Glu ser Asp Thr lie Arg Ser lie Ser lie Ala Ser Gin Cys Pro 
20 105 no 

Thr Ala Gly Asn Ser Cys Leu Val Ser Gly Trp Gly Leu Leu Ala Asn 
115 12 ° 125 

25 Gly Arg Met Pro Thr Val Leu Gin Cys Val Asn Val Ser Val Val Ser 

135 140 

Glu Glu Val Cys Ser Lys Leu Tyr Asp Pro Leu Tyr His Pro Ser Met 

150 15 5 160 

Phe Cys Ala Gly Gly Gly Gin Asp Gin Lys Asp Ser Cys Asn Gly Asp 
165 I'" 175 

Ser Gly Gly Pro Leu lie Cys Asn Gly Tyr Leu Gin Gly Leu Val Ser 
180 185 190 

Phe Gly Lys Ala Pro Cys Gly Gin Val Gly Val Pro Gly Val Tyr Thr 
195 2 °0 205 

Asn Leu Cys Lys Phe Thr Glu Trp lie Glu Lys Thr Val Pro Gly Gin 

215 220 



Leu Thr Leu Gly Thr Gly Asn Pro 
225 230 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
c n (A) LENGTH: 300 amino acids 

^ <B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

55 <v> FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

5 

Met Ala Ala Ala Ala Gly Asp Ala Asp Asp Glu Pro Arg Ser Gly His 
15 10 15 

Ser Ser Ser Glu Gly Glu Cys Ala Val Ala Pro Glu Pro Leu Thr Asp 
10 20 25 30 

Ala Glu Gly Leu Phe Ser Phe Ala Asp Phe Gly Ser Ala Leu Gly Gly 
35 40 45 

15 Gly Gly Ala Gly Leu Ser Gly Arg Ala Ser Gly Gly Ala Gin Ser Pro 

50 55 60 



20 



35 



50 



Leu Arg Tyr Leu His Val Leu Trp Gin Gin Asp Ala Glu Pro Arg Asp 
65 70 75 80 

Glu Leu Arg Cys Lys lie Pro Ala Gly Arg Leu Arg Arg Ala Ala Arg 
85 90 95 



Pro His Arg Arg Leu Gly Pro Thr Gly Lys Glu Val His Ala Leu Lys 
25 100 105 110 

Arg Leu Arg Asp Ser Ala Asn Ala Asn Asp Val Glu Thr Val Gin Gin 
115 120 125 

30 Leu Leu Glu Asp Gly Ala Asp Pro Cys Ala Ala Asp Asp Lys Gly Arg 

130 135 140 

Thr Ala Leu His Phe Ala Ser Cys Asn Gly Asn Asp Gin lie Val Gin 
145 150 155 160 



Leu Leu Leu Asp His Gly Ala Asp Pro Asn Gin Arg Asp Gly Leu Gly 
165 170 175 



Asn Thr Pro Leu His Leu Ala Ala Cys Thr Asn His Val Pro Val lie 
40 180 185 190 

Thr Thr Leu Leu Arg Gly Gly Ala Arg Val Asp Ala Leu Asp Arg Ala 
195 200 205 

45 Gly Arg Thr Pro Leu His Leu Ala Lys Ser Lys Leu Asn lie Leu Gin 

210 215 220 

Glu Gly His Ala Gin Cys Leu Glu Ala Val Arg Leu Glu Val Lys Gin 
225 230 235 240 



lie He His Met Leu Arg Glu Tyr Leu Glu Arg Leu Gly Gin His Glu 
245 250 255 



Gin Arg Glu Arg Leu Asp Asp Leu Cys Thr Arg Leu Gin Met Thr Ser 
55 260 265 270 
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10 



15 



20 



35 



45 



50 



Thr Lys Glu Gin Val Asp Glu Val Thr Asp Leu Leu Ala Ser 



275 



Phe Thr 
285 



Ser Leu Ser Leu Gin Met Gin Ser Met Glu Lys Arc 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 



Met Met Leu Gly Arg Phe Arg Glu Ala Leu Gly Asp Ala Gin Gin Ser 
25 5 10 15 

Val Arg Leu Asp Asp Ser Phe Val Arg Gly His Leu Arg Glu Gly Lys 
20 25 30 

30 ^ « U Gly ASn Ala Met Ala Ala C V S *rg Ser Phe Gin 

35 40 45 

Arg Ala Leu Glu Leu Asp His Lys Asn Ala Gin Ala Gin Gin Glu Phe 



50 55 



60 



Lys Asn Ala Asn Ala Val Met Glu Tyr Glu Lys He Ala Glu Thr Asp 

65 70 '5 eo P 

Phe Glu Lys Arg Asp Phe Arg Lys Val Val Phe Cys Met Asp Arg Ala 
40 " 90 95 

Leu Glu Phe Ala Pro Ala Cys His Arg Phe Lys lie Leu Lys Ala Glu 
100 105 110 

Cys Leu Ala Met Leu Gly Arg Tyr Pro Glu Ala Gin Ser Val Ala Ser 
H5 120 125 

Asp lie Leu Arg Met Asp Ser Thr Asn Ala Asp Ala Leu Tyr Val Arg 
130 135 140 3 

Gly Leu Cys Leu Tyr Tyr Glu Asp Cys He Glu Lys Ala Val Gin Phe 



150 



155 



160 



Phe Val Gin Ala Leu Arg Met Ala Pro Asp His Glu Lys Ale 



- - ia Q y s Ile 

55 165 170 175 



WO 98/12302 



PCT/US97/15627 



-90- 

Ala Cys Arg Asn Ala Lys Ala Leu Lys Ala Lys Lys Glu Asp Gly Asn 
180 185 190 

Lys Ala Phe Lys Glu Gly Asn Tyr Lys Leu Ala Tyr Glu Leu Tyr Thr 
5 195 200 205 

Glu Ala Leu Gly lie Asp Pro Asn Asn lie Lys Thr Asn Ala Lys Leu 
210 215 220 

10 Tyr Cys Asn Arg Gly Thr Val Asn Ser Lys Leu Arg Lys Leu Asp Asp 

225 230 235 240 



15 



Ala lie Glu Asp Cys Thr Asn Ala Val Lys Leu Asp Asp Thr Tyr lie 

245 250 255 

Lys Ala Tyr Leu Arg Arg Ala Gin Cys Tyr Met Asp Thr Glu Gin Tyr 
260 265 270 



Glu Glu Ala Val Arg Asp Tyr Glu Lys Val Tyr Gin Thr Glu Lys Thr 
20 275 280 285 

Lys Glu His Lys Gin Leu Leu Lys Asn Ala Gin Leu Lys Phe Arg Asn 
290 295 300 

25 Tyr Lys Phe Gin 

305 

(2) INFORMATION FOR SEQ ID NO: 34; 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 438 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE,: peptide 

(v) FRAGMENT TYPE: internal 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Met Cys Pro Asn Asn Ala Ser Tyr Tyr Gly Asn Arg Ala Ala Thr Leu 
1 5 10 15 

45 

Met Met Leu Gly Arg Phe Arg Glu Ala Leu Gly Asp Ala Gin Gin Ser 
20 25 30 

Val Arg Leu Asp Asp Ser Phe Val Arg Gly His Leu Arg Glu Gly Lys 
50 35 40 45 

Cys His Leu Ser Leu Gly Asn Ala Met Ala Ala Cys Arg Ser Phe Gin 
50 55 60 



55 



Arg Ala Leu Glu Leu Asp His Lys Asn Ala Gin Ala Gin Gin Glu Phe 
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fi 5 70 75 



80 



Lys Asn Ala Asn Ala Val Met Glu Tyr Glu Lys lie Ala Glu Val Asd 

85 90 95 

Phe Glu Lys Arg Asp Phe Arg Lys Val Val Phe Cys Met Asp Arg Ala 
100 105 110 

Leu Glu Phe Ala Pro Ala Cys His Arg Phe Lys lie Leu Lys Ala Glu 
115 120 125 

Cys Leu Ala Met Leu Gly Arg Tyr Pro Glu Ala Gin Phe Val Al 



130 135 



a Ser 
140 



Arg 



Asp He Leu Arg Met Asp Ser Thr Asn Ala Asp Ala Leu Tyr Val 
145 150 155 160 

Gly Leu Cys Leu Tyr Tyr Glu Asp Cys He Glu Lys Ala Val Gin Phe 
20 165 170 175 

Phe Val Gin Ala Leu Arg Met Ala Pro Asp His Glu Lys Ala Cys Val 
180 185 190 

Ala Cys Arg Asn Ala Lys Ala Leu Lys Ala Lys Lys Glu Asp Gly Asn 

195 500 -*~r- 



205 



Lys Ala Phe Lys Glu Gly Asn Tyr Lys Leu Ala Tyr Glu Leu Tyr Thr 
210 215 220 



Glu Ala Leu Gly lie Asp Pro Asn Asn lie Lys Thr Asn Ala Lys 
225 230 235 



Leu 
240 



Tyr Cys Asn Arg Gly Thr Val Asn Ser Lys Leu Arg Gin Leu Glu Asp 
35 245 250 255 

Ala lie Glu Asp Cys Thr Asn Ala Val Lys Leu Asp Asp Thr Tyr lie 
260 265 270 

Lys Ala Tyr Leu Arg Arg Ala Gin Cys Tyr Met Asp Thr Glu Gin Phe 
275 280 285 

Glu Glu Ala Val Arg Asp Tyr Glu Lys Val Tyr Gin Thr Glu Lys Thr 
290 295 300 



Lys Glu His Lys Gin Leu Leu Lys Asn Ala Gin Leu Glu Leu Lys Lys 
305 310 315 7 3 £ 0 



Ser Lys Arg Lys Asp Tyr Tyr Lys lie Leu Gly Val Asp Lys Asn Ala 
50 330 335 

Ser Glu Asp Glu lie Lys Lys Ala Tyr Arg Lys Arg Ala Leu Met His 
340 3 «. 350 

His Pro Asp Arg His Ser Gly Ala Ser Ala Glu Val Gin Lys Glu Glu 
355 36 ° 365 
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10 



25 



Glu Lys Lys Phe Lys Glu Val Gly Glu Ala Phe Thr lie Leu Ser Asp 
370 375 380 

Pro Lys Lys Lys Thr Arg Tyr Asp Ser Gly Gin Asp Leu Asp Glu Glu 
385 390 395 400 

Gly Met Asn Met Gly Asp Phe Asp Ala Asn Asn lie Phe Lys Ala Phe 
405 410 415 

Phe Gly Gly Pro Gly Gly Phe Ser Phe Glu Ala Ser Gly Pro Gly Asn 
420 425 430 



Phe Tyr Phe Gin Phe Gly 
15 435 

(2) INFORMATION FOR SEQ ID NO.-35: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 245 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Met Ala Glu Asp Leu Ser Ala Ala Thr Ser Tyr Thr Glu Asp Asp Phe 
15 10 15 

35 Tyr Cys Pro Val Cys Gin Glu Val Leu Lys Thr Pro Val Arg Thr Thr 

20 25 30 



40 



Ala Cys Gin His Val Phe Cys Arg Lys Cys Phe Leu Thr Ala Met Arg 
35 40 45 

Glu Ser Gly Ala His Cys Pro Leu Cys Arg Gly Asn Val Thr Arg Arg 
50 55 60 



Glu Arg Ala Cys Pro Glu Arg Ala Leu Asp Leu Glu Asn lie Met Arg 

45 65 70 75 80 

Lys Phe Ser Gly Ser Cys Arg Cys Cys Ala Lys Gin lie Lys Phe Tyr 
85 90 95 

50 Arg Met Arg His His Tyr Lys Ser Cys Lys Lys Tyr Gin Asp Glu Tyr 

100 105 no 



55 



Gly Val Ser Ser lie lie Pro Asn Phe Gin lie Ser Gin Asp Ser Val 
115 120 125 
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Gly Asn Ser Asn Arg Ser Glu Thr Ser Thr Ser Asp Asn Thr Glu Thr 
130 135 140 



10 



Tyr Gin Glu Asn Thr Ser Ser Ser Gly His Pro Thr Phe Lys Cys Pro 

Leu Cys Gin Glu Ser Asn Phe Thr Arg Gin Arg Leu Leu Asp His Cys 
165 170 175 

Asn Ser Asn His Leu Phe Gin lie Val Pro Val Thr Cys Pro lie Cys 
180 "5 190 

Val Ser Leu Pro Trp Gly Asp Pro Ser Gin lie Thr Arg Asn Phe Val 
,5 195 20° 205 

Ser His Leu Asn Gin Arg His Gin Phe Asp Tyr Gly Glu Phe Val Asn 
10 215 



20 



30 



35 



40 



50 



55 



220 



Leu Gin Leu Asp Glu Glu Thr Gin Tyr Gin Thr Ala Val Glu Glu Phe 

230 



235 



240 



Phe Gin Val Asn He 
245 



25 (2) INFORMATION FOR SEQ ID NO: 36; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Ser Glu Glu Leu Ser Ala Ala Thr Ser Tyr Thr Glu Asp Asp Phe 
5 10 15 



Tyr Cys Pro Val Cys Gin Glu Val Leu Lys Thr Pro Val Arg Thr Ala 
45 20 25 30 



Ala Cys Gin His val Phe Cys Arg Lys Cys Phe Leu Thr Ala Met Arg 
35 40 45 

Glu Ser Gly lie His Cys Pro Leu Cys Arg Gly Ser Val Thr Arg Arg 

55 60 

Glu Arg Ala Cys Pro Glu Arg Ala Leu Asp Leu Glu Asn He Met Arg 

70 7 * 80 

Arg Phe Ser Gly Ser Cys Arg Cys Cys Ser Lys Lys He Lys Phe Tyr 
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85 90 95 

Arg Met Arg His His Tyr Lys Ser Cys Lys Lys Tyr Gin Asp Glu Tyr 
100 105 110 

5 

Gly Val Ser Ser Val lie Pro Asn Phe Lys lie Ser Gin Asp Ser Val 
115 120 125 

Arg Ser Ser Asn Arg Ser Glu Thr Ser Ala Ser Asp Asn Thr Glu Thr 
10 130 135 140 

Tyr Gin Glu Asp Thr Ser Ser Ser Gly His Pro Thr Phe Lys Cys Pro 
145 150 155 160 

15 Leu Cys Gin Glu Ser Asn Phe Thr Arg Gin Arg Leu Leu Asp His Cys 

165 170 175 

Asn Ser Asn His Leu Phe Gin lie Val Pro Val Thr Cys Pro lie Cys 
180 165 190 



20 



Val Ser Leu Pro Trp Gly Asp Pro Ser Gin lie Thr Arg Asn Phe Val 
195 200 205 



Ser His Leu Asn Gin Arg His Gin Phe Asp Tyr Gly Glu Phe Val Asn 
25 210 215 220 

Leu Gin Leu Asp Glu Glu Thr Gin Tyr Gin Thr Ala Val Glu Glu Ser 

225 230 235 240 



30 



Phe Gin Val Asn Met 
245 
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Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routme expenmentation, many equivalents of the specific embodiments of the invention 
5 descnbed herein. Such equivalents are intended to be encompassed by the following 
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What is claimed is: 

1 . An isolated nucleic acid molecule comprising a nucleotide sequence 
which encodes an 77 polypeptide or a bioactive fragment thereof. 

5 

2. The nucleic acid molecule of claim 1 , wherein the 77 polypeptide or 
bioactive fragment thereof is a vertebrate 77 polypeptide or a bioactive fragment thereof. 

3. The nucleic acid molecule of claim 1 , wherein the 77 polypeptide 

10 comprises an amino acid sequence which is at least about 70% homologous to the amino 
acid sequence of SEQ ID NO:3 1, 32, 33, 34, 35, or 36 or the amino acid sequence 
encoded by the nucleotide sequence of the DN A insert of the plasmid deposited with 
ATCC as Accession Number 98125, 98126, 98127, or 98128. 

1 5 4. The nucleic acid molecule of claim 3, wherein the 77 polypeptide 

comprises an amino acid sequence which is at least about 80% homologous to the amino 
acid sequence of SEQ ID NO:31, 32, 33, 34, 35, or 36 or the amino acid sequence 
encoded by the nucleotide sequence of the DNA insert of the plasmid deposited with 
ATCC as Accession Number 98125, 98126, 98127, or 98128. 

20 

5. The nucleic acid molecule of claim 4, wherein the 77 polypeptide 
comprises the amino acid sequence of SEQ ID NO:3 1 , 32, 33, 34, 35, or 36 or the amino 
acid sequence encoded by the nucleotide sequence of the DNA insert of the plasmid 
deposited with ATCC as Accession Number 98125, 98126, 98127, or 98128, 

25 

6. The nucleic acid molecule of claim 1 , comprising a nucleotide sequence 
which is at least about 70% homologous to the nucleotide sequence of SEQ ID NO:l, 2, 
3, 4, 5, or 6 or the nucleotide sequence of the DNA insert of the plasmid deposited with 
ATCC as Accession Number 98125, 98126. 98127, or 98128. 

30 

7. The nucleic acid molecule of claim 6, comprising a nucleotide sequence 
which is at least about 80% homologous to the nucleotide sequence of SEQ ID NO:l, 2, 
3. 4, 5, or 6 or the nucleotide sequence of the DNA insert of the plasmid deposited with 
ATCC as Accession Number 98125, 98126, 98127, or 98128. 



35 



WO 98/12302 



PCT/US97/15627 



25 



30 



-97- 

8. The nucleic acid molecule of claim 1 , which hybridizes to the nucleotide 
sequence of SEQ ID NO:l, 2, 3, 4, 5, or 6 or the nucleotide sequence of the DNA insert 
of the plasm.d deposited with ATCC as Accession Number 98 1 25, 98 1 26 98 1 27 or 
98128. 

5 

9. The nucleic acid molecule of claim 8, further comprising a label. 

1 0. The nucleic acid molecule of claim 1 , comprising the nucleotide sequence 
of SEQ ID NO: 1 , 2, 3, 4, 5, or 6 or the nucleotide sequence of the DNA insert of the 

10 plasmid deposited with ATCC as Accession Number 98125, 98126, 98127, or 98128. 

M. The nucleic acid molecule of claim 1 , which is genomic DNA. 

12. The nucleic acid molecule of claim I, which is cDNA. 

1 3. The nucleic acid molecule of claim 1 , which is RNA. 

14. An isolated nucleic acid molecule which is antisense to the nucleic acid 
molecule of claim 10. 

1 5. An isolated nucleic acid molecule which is antisense to a coding region 
of the coding strand of the nucleic acid molecule of claim 10. 

16. An isolated nucleic acid molecule which is antisense to a noncoding 
region of the coding strand of the nucleic acid molecule of claim 10. 

1 7. A vector comprising the nucleic acid molecule of claim 1 . 

1 8. The vector of claim 1 7, which is an expression vector. 



15 



20 



19. The vector of claim 1 8, which encodes a polypeptide comprising the 
ammo acid sequence of SEQ ID NO:31, 32, 33. 34, 35, or 36 or the amino acid sequence 
encoded by the nucleotide sequence of the DNA insert of the plasmid deposited with 
ATCC as Accession Number 98125, 98126. 98127, or 98128 

35 
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20. The vector of claim 18, which comprises the coding region of the 
nucleotide sequence of SEQ ID NO: 1 , 2, 3, 4, 5, or 6 or the coding region of the 
nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98125, 98126, 98127, or 98128. 

5 

21 . A host cell containing the vector of claim 1 7. 

22. A host cell containing the expression vector of claim 1 8. 

10 23. A method for producing a 77 polypeptide comprising culturing the host 

cell of claim 22 in an appropriate culture medium to produce a 77 polypeptide. 

24. The method of claim 23, further comprising isolating the 77 polypeptide 
from the culture medium or host cell. 

15 

25. A nonhuman transgenic animal carrying the nucleic acid molecule of 
claim 1. 

26. A nonhuman homologous recombinant animal which contains cells 
20 having an altered form of the nucleic acid molecule of claim 1 . 

27. An isolated 77 polypeptide or a bioactive fragment thereof encoded by the 
nucleotide sequence of the nucleic acid molecule of claim 1. 

25 28. The polypeptide of claim 27, which comprises the amino acid sequence 

of SEQ ID NO:3 1 , 32, 33, 34, 35, or 36 or the amino acid sequence encoded by the 
nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98125, 98126, 98127, or 98128. 

30 29. A composition comprising the polypeptide or bioactive fragment thereof 

of claim 27 and a pharmaceutical^ acceptable carrier. 

30. A fusion polypeptide comprising the 77 polypeptide of claim 27 and a 
second non-77 polypeptide. 

35 

31 . An antibody which is specifically reactive with an epitope of the 77 
polypeptide of claim 27. 



WO 98/12302 



-99- 



PCT/US97/15627 



32. The antibody of claim 3 1 , which is monoclonal. 

33. Use of a nucleic acid encoding a 77 polypeptide or a bioactive fragment 
5 thereof of claim 1 for the manufacture of a medicament for treating a weight disorder in 

a subject. 

34. Use of a nucleic acid encoding a 77 polypeptide or a bioactive fragment 
thereof of claim 1 for the manufacture of a medicament for treating diabetes in a subject 

10 

35. Use of a 77 polypeptide or a bioactive fragment thereof of claim 27 for 
the manufacture of a medicament for treating a weight disorder in a subject. 

36. Use of a 77 polypeptide or a bioactive fragment thereof of claim 27 for 
1 5 the manufacture of a medicament for treating diabetes in a subject. 

37. Use of a 77 modulator for the manufacture of a medicament for treating a 
disorder characterized by aberrant 77 polypeptide bioactivity or 77 nucleic acid 
expression in a subject. 

20 

38. The use of claim 37, wherein the 77 modulator is a small molecule. 

39. The use of claim 37, wherein the disorder is a weight disorder. 
25 40 - The use of claim 39, wherein the weight disorder is obesity. 

41. The use of claim 39, wherein the weight disoder is cachexia. 

42. The use of claim 39, wherein the weight disorder is anorexia nervosa 

30 

43. The use of claim 37, wherein the disorder is diabetes. 
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44. A method for identifying a compound capable of treating a disorder 
characterized by aberrant 77 polypeptide bioactivity or 77 nucleic acid expression 
comprising assaying the ability of the compound to modulate the bioactivity of the 77 
polypeptide or the expression of 77 nucleic acid thereby identifying a compound capable 
5 of treating a disorder characterized by aberrant 77 polypeptide bioactivity or 77 nucleic 
acid expression. 



45. The method of claim 44, wherein the disorder is a weight disorder. 



10 46. The method of claim 45, wherein the weight disorder is obesity. 

47. The method of claim 44, wherein the disorder is diabetes. 



48. A method for modulating a cell associated activity comprising contacting 
1 5 the cell with a 77 modulator such that a cell associated activity is altered relative to a cell 
associated activity of the cell in the absence of the 77 modulator. 



49. The method of claim 48, wherein the 77 modulator stimulates 77 
polypeptide bioactivity or 77 nucleic acid expression. 

20 

50. The method of claim 49, wherein the 77 modulator is an active 77 
polypeptide. 



5 1 . The method of claim 49, wherein the 77 modulator is a nucleic acid 
25 encoding a 77 polypeptide that has been introduced into the cell. 

52. The method of claim 48, wherein the 77 modulator inhibits 77 
polypeptide activity or TI nucleic acid expression. 

30 53. The method of claim 52, wherein the TI modulator is an antisense 77 

nucleic acid molecule. 



54. The method of claim 52, wherein the TI modulator is an antibody which 
is specifically reactive with an epitope of a TI polypeptide. 

35 

55. The method of claim 48, wherein the cell is present within a subject and 
the TI modulator is administered to the subject. 
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5 



20 



56. A method for determining if a subject is at risk for a disorder 
characterized by aberrant 77 polypeptide bioactivity or 77 nucleic acid expression 
comprising detecting, in a sample of cells from the subject, the presence or absence of a 
genetic lesion, wherein the genetic lesion is characterized by an alteration affecting the 
integrity of a gene encoding a 77 polypeptide or misexpression of the 77 gene. 



57. A method for identifying a compound which interacts with a 77 
polypeptide comprising contacting the 77 polypeptide with the compound under 
1 0 conditions which allow binding of the compound to the 77 polypeptide to form a 
complex; and detecting the formation of a complex of the 77 polypeptide and the 
compound in which the ability of the compound to interact with the TI polypeptide is 
indicated by the presence of the compound in the complex. 

15 58. A method for identifying a compound which modulates the interaction of 

an TI polypeptide with a target molecule comprising contacting, in the presence of the 
compound, the 77 polypeptide and the target molecule under conditions which allow 
binding of the target molecule to the TI polypeptide to form a complex; and detecting 
the formation of a complex of the TI polypeptide and the target molecule in which the 
ability of the compound to modulate interaction between the 77 polypeptide and the 
target molecule is indicated by an increase or a decrease in complex formation as 
compared to the amount of complex formed in the absence of the compound. 

59. The method of claim 58, wherein the target molecule is a tub polypeptide 
25 or a portion thereof. 

60. A method for detecting the presence of TI in a biological sample 
comprising contacting a biological sample with an agent capable of detecting TI 
polypeptide or mRNA such that the presence of 77 is detected in the biological sample 

30 

61. The method of claim 60, wherein the agent is a labeled or labelable 
nucleic acid probe capable of hybridizing to TI mRNA. 

62. The method of claim 60, wherein the agent is a labeled or labelable 
35 antibody capable of specifically binding to a TI polypeptide. 
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63. A kit for detecting the presence of 77 in a biological sample comprising a 
labeled or labelable agent capable of detecting a 77 polypeptide or mRNA in a biological 
sample. 

5 64. The kit of claim 63, wherein the agent is a nucleic acid probe capable of 

hybridizing to 77 mRNA. 



65. The kit of claim 63, wherein the agent is an antibody capable of 
specifically binding to a 77 polypeptide. 

10 
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